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Preface 


The Point of this Book 


How and what should we teach today’s undergraduates to prepare them for careers in mathemat- 
ically oriented areas? Furthermore, how can we ameliorate the quantum leap from introductory 
calculus and linear algebra to more abstract methods in both pure and applied mathematics? 
There is a subject which can take students of mathematics to the next level of development and 
this subject is, at once, intuitive, calculable, useful, interdisciplinary and, most importantly, inter- 
esting. Of course, I’m talking here about Differential Geometry, a subject with a long, wonderful 
history and a subject which has found new relevance in areas ranging from machinery design to 
the classification of four-manifolds to the creation of theories of Nature’s fundamental forces to 
the study of DNA. 

Differential Geometry provides the perfect transition course to higher mathematics and its 
applications. It is a subject which allows students to see mathematics for what it is — not 
the compartmentalized courses of a standard university curriculum, but a unified whole mixing 
together geometry, calculus, linear algebra, differential equations, complex variables, the calculus 
of variations and various notions from the sciences. Moreover, Differential Geometry is not just 
for mathematics majors, but encompasses techniques and ideas relevant to students in engineering 
and the sciences. Furthermore, the subject itself is not quantized. By this, I mean that there is 
a continuous spectrum of results that proceeds from those which depend on calculation alone 
to those whose proofs are quite abstract. In this way students gradually are transformed from 
calculators to thinkers. 

Into the mix of these ideas now comes the opportunity to visualize concepts and constructions 
through the use of computer algebra systems such as Maple and Mathematica. Indeed, it is 
often the case that the consequent visualization goes hand-in-hand with the understanding of 
the mathematics behind the computer construction. For instance, in Chapter 5, I use Maple 
to visualize geodesics on surfaces and this requires an understanding of the idea of solving a 
system of differential equations numerically and displaying the solution. Further, in this case, 
visualization is not an empty exercise in computer technology, but actually clarifies various 
phenomena such as the bound on geodesics due to the Clairaut relation. There are many other 
examples of the benefits of computer algebra systems to understanding concepts and solving 
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problems. In particular, the procedure for plotting geodesics can be modified to show equations 
of motion of particles constrained to surfaces. This is done in Chapter 7 along with describing 
procedures relevant to the calculus of variations and optimal control. At the end of Chapters 
1, 2, 3, 5 and 7 there are sections devoted to explaining how Maple fits into the framework of 
Differential Geometry. I have tried to make these sections a rather informal tutorial as opposed 
to just laying out procedures. This is both good and bad for the reader. The good comes from the 
little tips about pitfalls and ways to avoid them; the bad comes from my personal predilections 
and the simple fact that I am not a Maple expert. What you will find in this text is the sort of 
Maple that anyone can do. Also, I happen to think that Maple is easier for students to learn than 
Mathematica and so I use it here. If you prefer Mathematica, then you can, without too much 
trouble I think, translate my procedures from Maple into Mathematica. 

In spite of the use of computer algebra systems here, this text is traditional in the sense of 
approaching the subject from the point of view of the 19th century. What is different about 
this book is that a conscious effort has been made to include material that I feel science and 
math majors should know. For example, although it is possible to find mechanistic descriptions 
of phenomena such as Clairaut’s relation or Jacobi’s theorem and geometric descriptions of 
mechanistic phenomena such as the precession of Foucault’s pendulum in advanced texts (see 
[Arn78] and [Mar92]), I believe they appear here for the first time in an undergraduate text. Also, 
even when dealing with mathematical matters alone, I have always tried to keep some application, 
whether mathematical or not, in mind. In fact, I think this helps to show the boundaries between 
physics (e.g., soap films) and mathematics (e.g., minimal surfaces). 

The book, as it now stands, is suitable for either a one-quarter or one-semester course in 
Differential Geometry as well as a full-year course. In the case of the latter, all chapters may be 
completed. In the case of the former, I would recommend choosing certain topics from Chap- 
ters 1-7 and then allowing students to do projects, say, involving other parts. For example, a 
good one-semester course can be obtained from Chapter 1, Chapter 2, Chapter 3 and the first 
“half? of Chapter 5. This carries students through the basic geometry of curves and surfaces 
while introducing various curvatures and applying virtually all of these ideas to study geodesics. 
My personal predilections would lead me to use Maple extensively to foster a certain geometric 
intuition. A second semester course could focus on the remainder of Chapter 5, Chapter 6 and 
Chapter 7 while saving Chapter 4 on minimal surfaces or Chapter 8 on higher dimensional 
geometry for projects. Students then will have seen Gauss-Bonnet, holonomy and a kind of 
recapitulation of geometry (together with a touch of mechanics) in terms of the Calculus of 
Variations. There are, of course, many alternative courses hidden within the book and I can only 
wish “good hunting” to all who search for them. 


Projects 


I think that doing projects offers students a chance to experience what it means to do research in 
mathematics. The mixture of abstract mathematics and its computer realization affords students 
the opportunity to conjecture and experiment much as they would do in the physical sciences. 
My students have done projects on subjects such as involutes and gear teeth design, re-creation 
of curves from curvature and torsion, Enneper’s surface and area minimization, geodesics on 
minimal surfaces and the Euler-Lagrange equations in relativity. In Appendix C, I give five 
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suggestions for student projects in differential geometry. (Certainly, experienced instructors will 
be able to point the way to other projects, but the ones suggested are mostly “self-contained” 
within the text.) Groups of three or four students working together on projects such as these can 
truly go beyond their individual abilities to gain a profound understanding of some aspect of 
geometry in terms of proof, calculation and computer visualization. 


Prerequisites 


Ihave taught differential geometry to students (e.g., mathematics, physics, engineering, chemistry, 
biology and philosophy majors) with as little background as a complete calculus sequence and 
a standard differential equations course. But this surely is a minimum! While a touch of linear 
algebra is used in the book, I think it can be covered concurrently as long as a student has 
seen matrices. Having said this, I think a student can best appreciate the interconnections among 
mathematical subjects inherent in differential geometry when the student has had the experience 
of one or two upper-level courses. 


Book Features 


The reader should note two things about the layout of the book. First, the exercises are integrated 
into the text. While this may make them somewhat harder to find, it also makes them an essential 
part of the text. The reader should at least read the exercises when going through a chapter — 
they are important. Also, I have used Exercisex to denote an exercise with a hint or solution in 
Appendix B. Secondly, I have chosen to number theorems, lemmas, examples, definitions and 
remarks in order as is usually done using LaTeX. To make it a bit easier to find specific examples, 
there is a list of examples (with titles and the pages they are on) in Appendix A. 


Elliptic Functions and Maple Note 


In recent years, I have become convinced of the utility of the elliptic functions in differential 
geometry and the calculus of variations, so I have included a simplified, straightforward introduc- 
tion to these here. The main applications of elliptic functions presented here are the derivation of 
explicit parametrizations for unduloids and for the Mylar balloon. Such explicit parametrizations 
allow for the determination of differential geometric invariants such as Gauss curvature as well 
as an analysis of geodesics. Of course, part of this analysis involves Maple. These applications of 
elliptic functions are distillations of joint work with Ivailo Mladenov and I want to acknowledge 
that here with thanks to him for his insights and diligence concerning this work. 

Unfortunately, in going from Maple 9 to Maple 10, Maple developers introduced an error (a 
misprint!) into the procedure for Elliptic E. In order to correct this, give the command 


> ‘evalf/Elliptic/E11_E‘ :=parse (StringTools:-Substitute 
(convert (eval (‘evalf/Elliptic/E11_E‘) ,string) ,"F_0","E_0")): 


Maple also seems to have changed its “simplify” command to its detriment. Therefore, there 
may be slight differences between the Maple output as displayed in the text and what Maple 10 
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gives, but these differences are usually minor. Nevertheless, to partially correct this discrepancy 
in simplification, define the command 


> mysimplify:=a->‘simplify/do‘ (subsindets(a,function, 
x->sqrt (x*2)),symbolic): 


and then apply it to any expression you want to simplify further. 


These corrections are due to Alec Mihailovs. The rest of the procedures in the book work fine in 
Maple 10. 


Thanks 


Since the publication of the First Edition, many people have sent me comments, suggestions and 
corrections. I have tried to take all of these into account in preparing the present MAA Edition, 
but sometimes this has proved to be impossible. One reason for this is that I want to keep the book 
at a level that is truly accessible to undergraduates. So, for me, some arguments simply can’t be 
made. On the other hand, I have learned a great deal from all of the comments sent to me and, in 
some sense, this is the real payment for writing the book. Therefore, I want to acknowledge a few 
people who went beyond the call of duty to give me often extensive commentary. These folks 
are (in alphabetical order!): David Amold, David Bao, Neil Bomberger, Allen Broughton, Jack 
Chen, Rob Clark, Gary Crum, Dan Drucker, Lisbeth Fajstrup, Davon Ferrara, Karsten Grosse- 
Brauckmann, Sigmundur Gudmundsson, Sue Halamek, Laszlo Illyes, Greg Lupton, Takashi 
Kimura, Carrie Kyser, Jaak Peetre, John Reinmann, Ted Shifrin and Peter Stiller. Thanks to all 
of you. 

Finally, the writing of this book would have been impossible without the help, advice and 
understanding of my wife Jan and daughter Kathy. 


John Oprea 
Cleveland, Ohio 2006 
j.oprea@csuohio.edu 


(Also, see the website www.csuohio.edu/math/oprea for Maple updates.) 


For Users of Previous Editions 


The Maple work found in the present MAA Edition once again focusses on actually doing 
interesting things with computers rather than simply drawing pictures. Nevertheless, there are 
many more pictures of interesting phenomena in this edition. The pictures have all been created 
by me with Maple. In fact, by examining the Maple sections at the ends of chapters, it is usually 
pretty clear how all pictures were created. The version of Maple used for the Second Edition 
was Maple 8. The version of Maple used for the MAA Edition is Maple 10. The Maple work 
in the First Edition needed extensive revision to work with Maple 8 because Maple developers 
changed the way certain commands work. However, going from Maple 8 to Maple 10 has been 
much easier and the reader familiar with the Second Edition will have no trouble with the MAA 
Edition. Everything works the same. Should newer versions of Maple cause problems for the 
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procedures in this book, look at my website for updates. One thing to pay attention to concerning 
this issue of Maple command changes is the following. Maple no longer supports the “linalg” 
package. Rather, Maple has moved to a package called “LinearAlgebra” and I have changed all 
Maple work in the book to reflect this. This should be stable for some time to come, no matter 
what new versions of Maple arise. Of course, the one thing that doesn’t change is the book’s 
focus on the solutions of differential equations as the heart of differential geometry. Because of 
this, Maple plays an even more important role through its “dsolve” command and its ability to 
solve differential equations explicitly and numerically. 


Maple 8 to 9 


Maple 9 appeared in Summer of 2003 and all commands and procedures originally written 
for Maple 8 work with one small exception. The following Maple 8 code defines a surface of 
revolution with functions g and h. 


> hi=t->h(t) ;g:=t->g(t) ; 
h:=h 
8 :=8 
> surfrev:=[h(u)*cos(v) ,h(u)*sin(v) ,g(w)]; 
surfrev := [h(u) cos(v), h(w) sin(v), g(u)] 


This works fine in Maple 8, but Maple 9 complains about defining g and h this way saying that 
there are too many levels of recursion in the formula for “surfrev”. The fix for Maple 9 is simple. 
Just don't define g and h at all! Go straight to the definition of “surfrev”. In the present MAA 
edition, the code has been modified to do just this, but if you are still using Maple 8, then use the 
code above. 


Note to Students 


Every student who takes a mathematics class wants to know what the real point of the course 
is. Often, courses proceed by going through a list of topics with accompanying results and 
proofs and, while the rationale for the ordering and presentation of topics may be apparent to the 
instructor, this is far from true for students. Books are really no different; authors get caught up 
in the “material” because they love their subject and want to show it off to students. So let’s take 
a moment now to say what the point of differential geometry is from the perspective of this text. 

Differential geometry is concerned with understanding shapes and their properties in terms 
of calculus. We do this in two main ways. We start by defining shapes using “formulas” called 
parametrizations and then we take derivatives and algebraically manipulate them to obtain new 
expressions that we show represent actual geometric entities. So, if we have geometry encoded 
in the algebra of parametrizations, then we can derive quantities telling us something about that 
geometry from calculus. The prime examples are the various curvatures which will be encountered 
in the book. Once we see how these special quantities arise from calculus, we can begin to turn 
the problem around by restricting the quantities in certain ways and asking what shapes have 
quantities satisfying these restrictions. For instance, once we know what curvature means, we 
can ask what plane curves have curvature functions that are constant functions. Since this is, ina 
sense, the reverse of simply calculating geometric quantities by differentiation, we should expect 
that “integration” arises here. More precisely, conditions we place on the geometric quantities 
give birth to differential equations whose solution sets “are” the shapes we are looking for. 

So differential geometry is intimately tied up with differential equations. But don’t get the idea 
that all of those crazy methods in a typical differential equations text are necessary to do basic 
differential geometry. Being able to handle separable differential equations and knowing a few 
tricks (which can be learned along the way) are usually sufficient. Even in cases where explicit 
solutions to the relevant differential equations don’t exist, numerical solutions can often produce 
a solution shape. The advent of computer algebra systems in the last decade makes this feasible 
even for non-experts in computer programming. 

In Chapter 1, we will treat the basic building blocks of all geometry, curves, and we will do 
exactly as we have suggested above. We will use calculus to develop a system of differential 
equations called the Frenet equations that determine a curve in three-dimensional space. We will 
use the computer algebra system Maple to numerically solve these equations and plot curves in 
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space. But with any computer program there are inputs, and these are the parameters involved in 
the Frenet equations; the curvature and torsion of a curve. So, we are saying that the curvature 
and torsion determine a curve in a well-defined sense. This is exactly the program outlined above. 
Of course, we will also see that, by putting restrictions on curvature and torsion, we can see what 
curves arise analytically as well. 

In Chapter 2, we take what we have learned about curves and apply it to study the geometry 
of surfaces in 3-space. The key definition is that of the shape operator because from it flows all 
of the rest of the types of curvatures we use to understand geometry. The shape operator is really 
just a way to take derivatives “in a tangential direction” and is related to the usual directional 
derivative in 3-space. What is interesting here is that the shape operator can be thought of as a 
matrix, and this allows us to actually define curvatures in terms of the linear algebraic invariants 
of the matrix. For instance, principal curvatures are simply the eigenvalues of the shape operator, 
mean curvature is the average of the eigenvalues (i.e., one half the trace of the matrix) and Gauss 
curvature is the product of the eigenvalues (i.e., the determinant of the matrix). Of course, the 
challenge now is twofold: first, show that the shape operator and its curvature offspring reflect 
our intuitive grasp of the geometry of surfaces and, secondly, show that these curvatures are 
actually computable. This leads to the next chapter. 

In Chapter 3, we show that curvatures are computable just in terms of derivatives of a para- 
metrization. This not only makes curvatures computable, but allows us to put certain restrictions 
on curvatures and produce analytic solutions. For instance, we can really see what surfaces arise 
when Gauss curvature is required to be constant on a compact surface or when mean curvature 
is required to be zero on a surface of revolution. An important byproduct of this quest for com- 
putability is a famous result of Gauss that shows that Gauss curvature can be calculated directly 
from the metric; that is, the functions which tell us how the surface distorts usual Euclidean 
distances. The reason this is important is that it gives us a definition of curvature that can be 
transported out of 3-space into a more abstract world of surfaces. This is the first step towards a 
more advanced differential geometry. 

Chapter 4 deals with minimal surfaces. These are surfaces with mean curvature equal to zero 
at each point. Our main theme shows up here when. we show that minimal surfaces (locally) 
satisfy a (partial) differential equation known as the minimal surface equation. Moreover, by 
putting appropriate restrictions on the surface’s defining function, we will see that it is possible to 
solve the minimal surface equation analytically. From a more geometric (as opposed to analytic) 
viewpoint, we focus here on basic computations and results, as well as the interpretations of 
soap films as minimal surfaces and soap bubbles as surfaces where the mean curvature is a 
constant function. The most important result along these lines is Alexandrov’s theorem, where it 
is shown that such a compact surface embedded in 3-space must be a sphere. The chapter also 
discusses harmonic functions and this leads to a more advanced approach to minimal surfaces, 
but from a more analytic point of view. In particular, we introduce complex variables as the 
natural parameters for a minimal surface. We don’t expect the reader to have any experience with 
complex variables (beyond knowing what a complex number is, say), so we review the relevant 
aspects of the subject. This approach produces a wealth of information about minimal surfaces, 
including an example where a minimal surface does not minimize surface area. 

In Chapter 5, we start to look at what different geometries actually tell us. A fundamental 
quality of a “geometry” is the type of path which gives the shortest distance between points. For 
instance, in the plane, the shortest distance between points is a straight line, but on a sphere this 
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is no longer the case. If we go from Cleveland to Paris, unless we are very good at tunnelling, we 
must take the great circle route to achieve distance minimization. Knowing that shortest length 
curves are great circles on a sphere gives us an intuitive understanding of the curvature and 
symmetry of the sphere. So this chapter deals with “shortest length curves” (i.e., geodesics) on a 
surface. In fact, we modify this a bit to derive certain differential equations called the geodesic 
equations whose solutions are geodesics on the surface. Again, while it is sometimes possible to 
obtain analytic expressions for geodesics, more often we solve the geodesic equations numerically 
and plot geodesics to discover the underlying geometry of the surfaces. The geodesic equations 
may also be transported to a more abstract situation, so we begin to see more general geometric 
effects here as well. 

Chapter 6 is the culmination of much of what has come before. For in this chapter, we see 
how curvature can affect even the most basic of geometric qualities, the sum of the angles 
in a triangle. The formalization of this effect, which is one of the most beautiful results in 
Mathematics, is known as the Gauss-Bonnet theorem. We present various applications of this 
theorem to show how “abstract” results can produce concrete geometric information. Also in 
this chapter, we introduce a notion known as holonomy that has profound effects in physics, 
ranging from classical to quantum mechanics. In particular, we present holonomy’s effect on the 
precession of Foucault’s pendulum, once again demonstrating the influence of curvature on the 
world in which we live. 

Chapter 7 presents what can fairly be said to be the prime philosophical underpinning of the 
relationship of geometry to Nature, the calculus of variations. Physical systems often take a 
configuration determined by the minimization of potential energy. For instance, a hanging rope 
takes the shape of a catenary for this reason. Generalizing this idea leads yet again to a differential 
equation, the Euler-Lagrange equation, whose solutions are candidates for minimizers of various 
functionals. In particular, Hamilton’s principle says that the motions of physical systems arise 
as solutions of the Euler-Lagrange equation associated to what is called the action integral. A 
special case of this gives geodesics and we once again see geometry arising from a differential 
equation (which itself is the reflection of a physical principle). 

In Chapter 8, we revisit virtually all of the earlier topics in the book, but from the viewpoint of 
manifolds, the higher-dimensional version of surfaces. This is necessarily a more abstract chapter 
because we cannot see beyond three dimensions, but for students who want to study physics or 
differential geometry, it is the stepping stone to more advanced work. Systems in Nature rarely 
depend on only two parameters, so understanding the geometry inherent in larger parameter 
phenomena is essential for their analysis. So in this chapter, we deal with minimal submanifolds, 
higher-dimensional geodesic equations and the Riemann, sectional, Ricci and scalar curvatures. 
Since these topics are the subjects of many volumes themselves, here we only hope to indicate 
their relation to the surface theory presented in the first seven chapters. 

So this is the book. The best advice for a student reading it is simply this: look for the right 
differential equations and then try to solve them, analytically or numerically, to discover the 
underlying geometry. Now let’s begin. 


The Geometry of Curves 


1.1. Introduction 


The world of Euclidean geometry is inhabited by lines and planes. If we wish to go beyond this 
flat world to a universe of curvature, we need to understand more general types of curves and 
surfaces. This chapter is devoted to exploring curves. Later we shall see that a basic understanding 
of the geometry of curves is essential to understanding the nature of surfaces as well. 

A curve in 3-space R? is a continuous mapping a: J > R? where / is some type of interval 
(e.g., (0, 1), (a, b), (a. b], (—00, a], [0, 1] etc.) on the real line R. Because the range of @ is R’, 
a’s output has three coordinates. We then write, fort € 1, a parametrization for a, 


a(t) = (a'(t), a(t), a(t) 


where the a’ are themselves functions a’: J > R. A useful way to think about curves is to 
consider r to be time and a(t) to be the path of a particle in space. We say @ is differentiable or 
smooth if each coordinate function a is differentiable as an ordinary real-valued function of R. 
In fact, later, in order to define curvature and torsion, we will need each a! to be at least 3-times 
differentiable. In practice, we will not find this condition to be much of a constraint. 
The velocity vector of @ at to is defined to be 
e 


where da’ /dt is the ordinary derivative and |,-,, denotes evaluation of the derivative at ¢ = fo. 
We shall also write 42" (1) for this evaluation when it is convenient. In order to interpret a’ 
geometrically, we introduce the 


Example 1.1.1 (First Example of a Curve: a Line in R’). 

We know that two points determine a line. In the plane R?, this leads to the usual slope-intercept 
equation which gives an algebraic description of the line. Unfortunately, in R? we don’t have a 
good notion of slope, so we can’t expect exactly the same type of algebraic description to work. 


1 
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Figure 1.1. A parametrized line 


For two points, p and q, the line / joining them may be described as follows. To attain the line, 
add the vector p. To travel along the line, use the direction vector q — p since this is the direction 
from p to q. A parameter t tells exactly how far along q — p to go. Putting these steps together 
produces a curve which is a parametrized line (see Figure 1.1) in R*, 
a(t) =p + t(q—p). 

For instance, if p = (1, 2, 3) and q = (—1, 4, —7), then q — p = (—2, 2, —10), so the line through 
p and q is given by the curve 

a(t) = (1, 2,3) + ¢(—2, 2, —10) 

= (1 —2t,2 +21, 3 — 10). 


Exercise# 1.1.2. What is the parametrization of the line through (—1, 0, 5) and (3, —1, —2)? 


Exercise 1.1.3. In R*, what is the parametrization of the line through (—1,6,5,0) and 
(0, 1, —3, 9)? 


So we see that, given a line a(t) = p + t(q — p), a(t) = q — p. Since this direction vector q — p 
is then the velocity vector, we often write 


a(t)=pt+tv. 


Now that we understand lines, we can ask what the picture of the velocity vector is for any curve 
a(t)? The definitions give 
. 


1 3 
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Figure 1.2. A curve with tangent vector 


This looks much like our usual “slope” definition of the derivative. Note that the approximating 
vectors inside the limit always point in the direction of increasing t. For t < fg, this follows since 
t — fo is negative. Now, t — fo, so the vector moves closer and closer to being a tangent vector 
(see Figure 1.2) to the curve at a(fo). Hence, the velocity vector a’(tp) to @ at fo is a tangent vector 
to a at a(fo). Notice that we obtain a vector with a precise length, not a line. Remember that a 
vector v = (v!, v*, v’) € R?, has a length given by the Pythagorean theorem, 


Iv] = J(u)? + (v2)? + (032. 


Exercises 1.1.4. Show that |v] is the distance of (v!, v7, v?) from (0, 0, 0) by two applications 
of the Pythagorean theorem. 


A curve a(t) is regular if a'(t) #4 0 for all t € J. In general, if a curve is not regular, then we 
split it into regular pieces and consider each of these separately. The assumption of regularity 
ensures that we keep going in the same direction along the curve. This will be important when 
we consider arclength shortly. 


Example 1.1.5 (The Cusp a(t) = (t?, 1°). 
Note that x = #7, y= 1°, so y = +x°/* with a(t) = (21, 317) and a’(0) = (0, 0). Thus @ is not 
regular, but the pieces above and below the x-axis are regular. 


In our situation, |o’(t)| = (da! /dt)* + (da?/dt)* + (da3/dt) is simply the speed of w. Again, 
thinking of @ as the path of a particle and ¢ as time, we see that the length of the velocity vector 
is precisely the speed of the particle at the given time. For a line, a(t) = p+ tv, the speed is 
simply |v|, the length of the direction vector. Note that we have taken lines to have constant 
speed v. We will see later that every regular curve can be re-parametrized to have constant speed, 
so this is no real restriction. However, a given parametrization may disguise the fact that the 
underlying curve is a line until re-parametrized to have constant speed. We will keep our con- 
vention that lines are parametrized with constant speed because we want to characterize them by 
Proposition 1.1.7. 
If x(t) = (a! (t), w(t), o3(r)) is a curve in R°, then its acceleration vector is given by 


w= (4 S30), a (0), £00). 
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Exercises 1.1.6. Newton asked the question, what is the curve which, when revolved about an 
axis, gives a surface offering the least resistance to motion through a “rare” fluid (e.g., air)? 
Newton’s answer is the following curve (see Chapter 7). 


= All 3 A ] 2 34 7 
wn. on = (3 [e+e |: [n() +0 +i|-2), 


where A is a positive constant and x > 2A. Graph this curve and compute its velocity and 
acceleration vectors. What angle does the curve make with the x-axis at the intersection point 
(2A, 0)? 


We now come to the first (and simplest nontrivial) instance of calculus imposing a constraint 
on the geometry of a curve. In a real sense, this is what the differential geometry of curves is all 
about and we will see many more examples of it later. 


Proposition 1.1.7. The curve a is a (constant speed) straight line if and only if a” = 0. 


Proof. If a(t) = p + tv isa line, then a@’(t) = v (which is a constant vector), so a(t) = 0. 

If v(t) = 0 for all t, then d*a‘(t)/dt? = 0, for each coordinate function a(t). But a zero 2nd 
derivative simply means that da‘(t)/dt = v' is a constant. We may integrate with respect to f to 
obtain a(t) = p' + v't where p’ is a constant of integration. Then 


a(t) =(p' +1tv!, p? +10’, p +tv?) = pt+tv 


with p=(p!, p’, p’) and v=(v!, v*, v*). Hence the curve @ may be parametrized as 
a line. O 


This easy result indicates how we will use calculus to detect geometric properties. Just to see 
how far we can get using these elementary ideas, let’s consider the following question. What is 
the shortest distance between two points p and q in R*? We have been taught since we were 
children that the answer is a line, but now we can see why our intuition is correct. Calculus tells 
us that distance is simply the integral of speed. Therefore, the integral 


b 
La) = [ |a’(t)| dt 


calculates the arclength (i.e., the distance travelled) of a regular curve a: [a, b] > R? from a(a) 
to a(b). In order to see this, consider a small piece (1.e., “an infinitesimal piece”) of arclength 
approximated as in Figure !.3. The Pythagorean theorem then says that the hypotenuse of the right 
triangle with sides dx and dy has length] = ,/dx? + dy*. Suppose both x and y are parametrized 
by t. Then a(t) = (x(t), y(t)) and dx? =(dx/dty dt? and dy* =(dy/dty dt*. Finally, 
we have 
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dx 


Figure 1.3. An infinitesimal piece of curve 


expressing the relation, distance equals rate times time. Now we add up all the small pieces of 
the curve by integration to get, 


dx\? dy 
w= PVG) +3 
b 
af Ja’(t)| dt. 


The three-dimensional version of this intuitive description simply uses our earlier exer- 
cise on the Pythagorean theorem in three dimensions. Now, the length of the vector v, |vj, 
may be written ./v-v, where - denotes the dot product of vectors. Recall that, in general, 
vew=v!w!+v2w? +03w, for v=(v!, v?, v’) and w= (w!, w’, w?). We also have the 
following 


Proposition 1.1.8. The dot product v -w may be calculated by v - w = |v||w|cos0, where 0 is 
the angle between the vectors v and w. 


Proof. From Figure 1.4, we see that the usual properties of vectors give u = v — w. Hence 


jul? =u-u=(v—w)-(v—w) 
v-v—-2v-wt+w-w 


= |v? + |wi? — 2v-w. 


Il 


fe 


Figure 1.4. 


6 1. The Geometry of Curves 


But the Law of Cosines for this triangle says |u|? = [v|* + [w|? — 2|v| |w| cos@. Equating the 
two right-hand sides and eliminating like terms produces 


v- w= |v| |wicosé. 


Corollary 1.1.9 (Schwarz’s Inequality). The dot product obeys the inequality, 


Iv-wl < |v] |wI. 
Proof. Simply note that | cos @| < 1. C 


We note here that, with respect to the derivative, the dot product behaves just as ordinary 
multiplication — namely, the Leibniz (or product) rule holds. In particular, 
Proposition 1.1.10. [fa(t) and B(t) are two curves in R°, then 


dap) _ da, a8 
dt dt gue dt° 


Proof. We use the ordinary product rule on the component functions, 


d@-B)_ diin 2 p2 3 23 
Fi =r +a‘ p* +a° Bp) 


O 


When the parameter f is understood, we will write the product rule as (a - 8)’ = a’- B+a- B’. 
We are now able to answer the question of which (smooth) path between two points gives the 
shortest distance. 


Theorem 1.1.11. Jn R?, a line is the curve of least arclength between two points. 
Proof, Consider two points p, q € R°. The line between them may be parametrized by /(t) = 


p ++#(q —p), where q — p is the vector in the direction from p to q. Then /’(t) = q — p and 
\’(t)| = |q — p], a constant. Therefore, 


1 { 
LO= / W(Oldt = Iq— pI i d= \q—p! 
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P 


Figure 1.5. Line and comparison curve 


and the length of the line segment (or direction vector) from p to q is the distance from p 
to q (as of course we expected). Now consider another curve a which joins p and q (see 
Figure 1.5). 

We want to show that L(a@) > L(/) and, since q@ is arbitrary, this will say that the straight line 
minimizes distance. Now, why should @ be longer than /? One intuitive explanation is to say that 
a@ starts off in the wrong direction. That is, a’(a) is not “pointing toward” q. How can we measure 
this deviation? The angle between the unit vector in the direction of q — p, u = (q — p)/lq — pl. 
and a’(a), the tangent vector of a at p, may be determined using the dot product @’(a) - u. The 
total deviation may be added up by integration to give us an idea of why L(a) > L(/) should 
hold. Precisely, we compute f : a’(t)- udt in two ways to obtain the inequality. Now, we have 
(a(t)- uy =a’(t)-u+a(t)-u’ =a’(t)- u since wu is a constant vector. Also, the Fundamental 
Theorem of Calculus gives i df/dtdt = f(b) — f(a), so 


b b 
[e@-nar= [ e@)-w'dt=a(b)-w-at@)-u 


=q:u-—p-u since a(a) =p, a(b)=q 
=(q—p)-u 
—p)-(q—p) 
_ (4—P)-(@—P (1) 
lq— Pl 
_ la-pl 
lq—Pl 
=lq-pl 
= Lil), 
the straight line distance from p to q. We also have, 
b b 
i a’(t)-udt < / je'(t)| - |u| dt by the Schwarz inequality 
b 
= ja’(t)| dt since |u| = 1 (2) 


= L(a). 
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Figure 1.6. Circle of radius r 


PE 
By, 


Combining (1) and (2), we have L(/) < L(@). Note also that a’(t) -u = |a’(t)| - |u| only when 
cos@ = 1, or 6 = 0. That is, a’(t) must be parallel to q — p for all r. In this case q@ is the line 
from p to q. Therefore, we have the strict inequality L(/) < L(q@) unless @ is a line. O 


So, using the first notions of calculus, we have seen how to identify lines in R? and to determine 
exactly why lines give the shortest distance between two points. Of course, R? is governed by 
Euclidean Geometry. One of the main goals of differential geometry is to develop analogous 
techniques for curved geometries as well. 

While curves described parametrically may be defined at the whim of the definer, in fact they 
often arise from geometric or physical considerations. Several of the examples below illustrate 
this. 


Example 1.1.12 (The Circle of Radius r Centered at (0, 0)). 

The circle is one of the most useful and familiar curves in geometry. Of course, the question 
is, how can such a curve be parametrized? The definitions of sin@ and cos@ as the vertical 
and horizontal sides of the right triangle with angle 6 formed by dropping a vertical line from 
P to the horizontal axis give us a way of assigning coordinates to the point P when r = 1. 
Namely, the point P has coordinates (cos @, sin@) (see Figure 1.6). For a circle of radius r, by 
the fundamental property of similar triangles, the vertical and horizontal sides of the triangle 
must be r sin@ and rcos@ respectively. Hence, the coordinates of the point on the circle are 
(r cos@, rsin@). Therefore, the circle may be parametrized by the angle 6. Again thinking of the 
circle as the path of a particle, we write 


a(t) =(rcost, rsint), for0 <t < 27. 


There are two important observations to remember. First, we have a’(t) = (—r sint, r cost), so 
a(t)-a@'(t) = 0 for all r. In other words, the tangent vector is perpendicular to the radius. Second, 
a"(t) =(—rcost, —rsint) = —a(t), so the acceleration vector points toward the center of the 
circle. Finally, we can compute the arclength (i.e., circumference) of the circle, 


2n 2n 
L@)= i jo'(t)|dt = Vr? sin? t + r2 cos? t dt 
0 0 
2n 2n 
= ry sin? t + cos? t dt = rdt 
0 0 


=rt\|." =20r. 
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Figure 1.7. A cycloid 


Exercise 1.1.13. Suppose a circle of radius a sits on the x-axis making contact at (0, 0). Let the 
circle roll along the positive x-axis. Show that the path a followed by the point originally in 
contact with the x-axis is given by 


a(t) = (a(t —sint), a(1 —cost)) 


where f is the angle formed by the (new) point of contact with the axis, the center and the original 
point of contact. This curve is called a cycloid (see Figure 1.7) and we will meet it again in 
Chapter 7. Hints: Recall that s = at where s is arclength. Draw a picture after the circle has 
rolled s units. Where is the old point of contact? 


Exercise 1.1.14. Consider a cycloid of the form 
(x(t), y(t)) = (A + a(t — sint), B —a(1 —cosf)). 


Graph this cycloid to see that it is an inverted form of the one found in the previous exercise. 
Suppose a unit mass particle starts at rest at a point (x, y) on the cycloid corresponding to an 
angle f in the parametrization above. Under the influence of gravity (and assuming no friction), 
show that, no matter what initial 7 is chosen, it always takes a time of 


a : ess 
T= /-7, where g is the gravitational constant 
& 


for the particle to slide down to the bottom of the cycloid (1.e., = 2). This property was used by C. 
Huygens to make clocks without pendula (which he hoped would allow for accurate timekeeping 
on ships at sea and, thereby, improve navigation). He named the cycloid the tautochrone since 
the latter means “same time”. We will revisit the cycloid in Chapter 7 where we will see it has 
yet another intriguing property and name. Hints: potential energy is turned into kinetic energy by 
v?/2 = g(¥ — y). Time equals distance divided by speed, so 


rege [ee 
= >s> SS _ Ax 
28 Ji y-y 
® 1— t 
= Fal earn EeNe P 
gJi cost — cost 


a > _ | 2 cos . 
= _/— {| —2arcsin | ————*+ 
g v¥1+cost 


rs 
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Figure 1.8. An astroid 


using y’ = (dy/dt)/(dx /dt) = —sint/(1 — cost), ¥ = B —a(1 —cosf), y = B — a(1 — cost) 
and simplifying. Verify the final step by differentiation and compute T. Now see Section 1.7 for 
a Maple approach. 


Example 1.1.15 (The Astroid). 

Consider the astroid a(t) = (acos? t, asin’ t) for 0 < t < 2m (see Figure 1.8). The definition of 
the astroid (which was discovered by people searching for the best form of gear teeth) is very 
similar to that of the cycloid. For the astroid, however, a circle is rolled, not on a line, but inside 
another circle. More precisely, let a circle of radius a/4 roll inside a large circle of radius a 
(centered at (0, 0) say). For concreteness, suppose we start the little circle at (a, 0) and follow the 
path of the point originally in contact with (a, 0) as the circle rolls up. Roll the circle a little bit 
and notice that a piece of arclength s is used up. The key to understanding this situation is the fact 
that, for t, the angle from the center of the large circle to the new contact point and @, the angle 
through which the small circle has rolled (measured from the point of contact), we have s = at 
and s = a@/4. Hence, 6 = 4. 

Now we can do two things to parametrize the path of the rolling point. First, we can parametrize 
the path with respect to the center of the little circle. This is easy because we are just moving an 
angle @ around a circle. There is a small problem because we measure 6 from the point of contact, 
but this is handled easily as in Figure 1.9. Note that we make real use of the relation @ = 4t here. 
The coordinates with respect to the center then become x = (a/4) cos 3t and y = (a/4) sin 3t. 


A 


/\ 


Figure 1.9. 
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Figure 1.10. The Witch of Agnesi 


Secondly, the center of the little circle always remains on the circle of radius 3a/4 centered at 
(0, 0), so that the center has moved to ((3a/4) cost, (3a/4) sint). Hence, with respect to the 
origin (0, 0), the rolling point has moved to 


3 3 
a(t) = (+ cost + ; cos 3f, > sint — 5 sin3r). 


Exercise 1.1.16. Verify the statements above and show that the formula for the astroid may be 
reduced to 


a(t) = (a cos*t, a sin’ r) 


with implicit form x77 + y?? = a/, 


Exercise 1.1.17. From the origin draw a line through any point P of the circle of radius a centered 
at (0, a). Find the intersection Q of this line with the horizontal line y = 2a. Drop a vertical line 
from Q and intersect it with a horizontal line passing through P. This intersection is a point on 
the curve known as the witch of Agnesi.! See Figure 1.10. Let t denote the angle between the 
vertical axis and the line through (0, 0), P and Q. Show that the witch W(r) is given in terms of 
t by 


W(t) = (2a tant, 2a cos*r). 


Example 1.1.18 (The Helix). 

Consider the parametrization a(t) = (acost, asint, bt), for 0 <t < oo. Notice that the the 
first two coordinates provide circular motion while the third coordinate lifts the curve out of the 
plane. The resulting helix is then seen in Figure 1.11. We calculate a’(t) = (—a sint, acost, b) 
and a”(t) = (—a cost, —asinr, 0). Note that a”(t) points toward the z-axis. 


Example 1.1.19 (The Suspension Bridge). 
Consider the diagram of a suspension bridge (Figure 1.12) where the cable supports a uniformly 
distributed load. This means that, over any interval [a, b], the weight supported is given by 


'The ‘witch’ was named by Maria Agnesi in the middle 1700s —— apparently as the result of a mistranslation! The 
curve had been studied previously by Grandi who had used the Italian word “versorio”, meaning “free to move in any 
direction”, in connection with it. Agnesi mistakenly thought Grandi had used the Italian “versiera”, which means “Devil's 
wife” or “‘witch” and so Agnesi’s witch got “her” name [Yat74]. 
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Figure 1.11. A helix 
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Figure 1.12. Diagram of a suspension bridge 
W =c(b — a), where c is constant. Newton’s Law F = ma applies to the vertical and horizontal 
components of tension in the cable T to give force equations 

T sin@ =cx and T cos@ = To 


where x and 6 are as depicted in Figure 1.12 and 7p is the constant horizontal force of tension 
pulling on the length of cable from 0 to x. 


Exercise 1.1.20. Using dy/dx = tan 9, solve these equations to get 


Oe 233 
== d. 
y 3T* + 


Thus, the cable of a suspension bridge hangs as a parabola. 


Example 1.1.21 (The Catenary). 

Now suppose a cable hangs freely only supporting its own weight (Figure 1.13). What is the 
curve it follows? The key difference between the preceding example and this is that the weight is 
not uniformly distributed horizontally, but, rather, is uniformly distributed along the length of the 
cable. Therefore, we must take arclength into account as opposed to the simpler distance along 
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Figure 1.13. Diagram of a hanging cable 


the axis. Let W denote the (weight) density of the cable and let s = s(x) denote the arclength of 
the cable from x = 0. 


Exercise 1.1.22. From the diagram, Newton’s Law F = ma, dy/dx = tan@ and the arclength 
derivative ds/dx = \/1 + (y’)’, derive the equations 


dy W dy W dy? 
ee di See ali. 
eo SE. Bae aay 


Now solve the second equation to get 


y =C cosh (=) + D. 


Hints: to solve the differential equation, let z = dy/dx. The equation becomes separable 


dz WA 
=—dx 
V1l+2 To 


and the left-hand side may be integrated by substituting in z = sinhw and recalling cosh? u — 
sinh? u = 1. 


A curve of the form y = c cosh(x/c) is called a catenary from the Latin for “chain” (i.e., 
“catena”). We will meet this curve again and again in the rest of the book. 


Example 1.1.23 (The Pursuit Curve). 

Suppose an enemy plane begins at (0, 0) and travels up the y-axis at constant speed v,. A missile 
is fired at (a,0) where a < 0 with constant speed v,, and the missile has a heat sensor which 
always directs it toward the plane. Show that the pursuit curve (see Figure 1.14) which the missile 
follows is given implicitly by the differential-integral equation 


v x 
yory'+ 2 | Vlty?dx. 
Vm a 
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-1 0.8 -0.6 0.4 -0.2 


Figure 1.14. A pursuit curve 


Differentiate this expression to get a separable differential equation. Integrate to get the closed 
form expression for the pursuit curve 


ye # 
S0hG) es = 
rn aa 


Hints: (1) Show that y =x y’+ vpt, where ¢ is the time from firing. Express ¢ in terms of 
arclength by looking at the missile’s D = RT equation. (2) Use y’(a) = 0 to determine a constant 
of integration. 


Example 1.1.24 (The Mystery Curve). 


Consider the curve a(t) = F cost, sint, F cost) forO <t < 27. 


Exercises 1.1.25. Identify this curve. Find e’(t), a”(t), and L(q@). Hints: what kind of curve is 
this? Where does it lie in R?? 


Exercises 1.1.26. Parametrize a circle which is centered at the point (a, b). 


Exercise 1.1.27. Parametrize the ellipse 5 ae y=. 


a Be 


1.2 Arclength Parametrization 


So far, we have thought of the parameter ¢ as being time. More often it is convenient to let the 
parameter express how far along the curve we are. That is, when we now write a(s), the parameter 
s is exactly the distance we have travelled along the curve. This is called parametrization by 
arclength. As we will see, arclength parametrization simplifies the differential geometry of curves 
considerably. Therefore, it would be nice to know that every curve may be parametrized in this 
way. This is, in fact, the case, although it is only “in principle” that this process works in general. 
By this we mean to say that, while any regular curve may be parametrized by arclength, it is quite 
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rare to obtain an explicit formula for the arclength parametrization. With this in mind, we will 
now first consider the notion of parametrization a little more carefully, then prove that every curve 
has an arclength parametrization. In the next section we will use the arclength parametrization to 
derive the Frenet Formulas, the formulas which describe the geometry of curves. 

Given a curve a: J] > R? with parameter t, we may reparametrize the curve by mapping 
another interval onto / and using the composition as a “new” curve (which, of course, has the 
same point-set image in R>). Precisely, let h: J > ] be a map of the interval J onto /. Then a 
reparametrization of a is given by 


B=ach:J>R, = Br) =a(A(r)). 


The curves f and @ pass through the same points in R?, but they reach any one of these points in 
different “times” (7 versus ft). The velocity vectors of a and £ are altered by h. 


Lemma 1.2.1. 6’(r) = a'((r))- “(r). 


Proof. By definition, B(r) = (@'(A(r)), a?(h(r)), o3(h(r)). The derivative of the coordinate 
function B' is, 


dBi ) dai(h(r)) da’ dh 
—(r) = ————- = —_ — 
dr dr dh dr 
by the chain rule. But da‘/dh is exactly da'/dt where h(r)=t€J/, so we obtain for 


t=h(r), 


Br) da' dh da?dh da dh oe! 
= (——, —-—, —— ] =a'(t)—. 
dt dr’ dt dr’ dt dr dr 

im 
Exercises 1.2.2. Recall that the arclength of a curve a: [a,b] > R? is given by L(a) = 
fle(ldt. Let B(r): [c,d] > IR? be a reparametrization of a defined by taking a map 
h: [c, d] > [a, b] with A(c) = a, h(d) = b and h’(r) = 0 for all r € [c, d]. Show that the arc- 
length does not change under this type of reparametrization. The same holds for the case h’(r) < 0. 


Theorem 1.2.3. [fa is a regular curve, then a may be reparametrized to have unit speed. 


Proof. Define the “arclength function” to be 


t= f lo'(u)| du 


(where u is just a “dummy” variable). Since @ is regular, the Fundamental Theorem of Calculus 
implies 


of ihe 0 
7 = Wl > 0. 
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By The Mean Value Theorem, s is strictly increasing on / and so is one-to-one. Therefore, s has 
an inverse function which we denote by f(s) and the respective derivatives are inversely related, 


1 


Br(s)) ~ 


Let B(s) = a(t(s)). Then, using Lemma 1.2.1, B’(s) = e'(t(s)) 4(s). But now we have, 


GF se 
i 


d 
|B'(s)| = [a’(t(s))| IF 
Ss 
= ds dt 
= 7 ts) ae 
ds 1 
= —i(t —_——_- 
ai) Eee 
=) 


0 


Without loss of generality, suppose 6 is defined on the interval [0, 1]. Consider the arclength of 
the re-parametrization £ out to a certain parameter value so, 


L(so) = [ [B'(s)|ds = i lds = So. 
0 0 


This is, of course, exactly what we mean by “arclength parametrization”. Hence, a curve is 
parametrized by arclength exactly when it has unit speed. 


Example 1.2.4 (Helix Re-Parametrization). 

Consider the helix a(t) = (a cost, a sint, bt) witha’(t) = (—asint, acost, b). We easily calcu- 
late that |a’(t)| = Va? + b? = c. Then s(t) = A cdu = ct and the inverse function is f(s) = ts, 
So an arclength re-parametrization is given by 


B(s) = a(s/c) = (acos ©, asin = =) : 
c é 7é 


Note that |[8’(s)| = 1. This example generalizes to the case of any a with constant speed. Such 
a curve may be parametrized by arclength explicitly. For general curves, however, the integral 
defining s may be impossible to compute in closed form. 


Exerciset 1.2.5. Re-parametrize the circle of radius r to have unit speed. 
Exercisee 1.2.6. Can you explicitly re-parametrize the ellipse to have unit speed? 


Exercise 1.2.7. Let a(t) be a curve not necessarily parametrized by arclength. Suppose a string 
is wound around the curve @ and is attached to the curve at some point where the end of the string 
falls. Let s be the arclength function along a from some fixed point on a. If the string is kept taut 
and unwound from the curve starting at the beginning of the string, then the free end always lies 
a distance s(t) away from the point a(t) in the direction —a’(t). Thus, show that the curve swept 
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out by the free end is given by 


a’(t) 
la’(t)| 


This curve is an involute of a. (Note that different starting points give different involutes.) 
Calculate and graph the involute of the unit circle a(t) = (cost, sint, 0) (see Figure 1.27). Note 
that the involute of a unit speed curve a(s) is given by Z(s) = a(s) — s a@’(s). For an application 
of circle involutes to gear teeth design see [SU80]. 


L(t) = a(t) — s(t) 


Exercise 1.2.8. Show that the involute of a helix is a plane curve. 


1.3 Frenet Formulas 


In this section we shall assume our curves are parametrized by arclength. Consider this problem. 
Suppose a robot arm with three fingers at its end is to move and grasp an object. How should a 
path be programmed for the “hand” so that the fingers end up in the proper “grasping” position? 
Somehow, a curve must be parametrized to not only direct the robot hand to the proper position, 
but to align the fingers correctly as well. We will see that the geometry of a curve (i.e., its twisting 
and turning) may be completely described by understanding how three “fingers” attached to the 
curve vary as we move along the curve. The variation is described by what are now called the 
Frenet Formulas. What are these “fingers” which are attached to the curve? 

The first finger is the unit tangent vector to the curve. Let B: J > R? be a unit speed curve 
(e., |B’| = 1). Denote the unit tangent vector along B by T = B’. Because |T| = 1 everywhere 
along f, the derivative T’ measures only the rate of change of T’s direction. Hence, T’ is a good 
choice to detect some of B’s geometry. 


Proposition 1.3.1 (Trick to Remember). |T| = /T -T = 1, so T -T = 1 as well. The product 
rule then gives 


O=(7-TY =T'-T4+T-T' =2T-T". 
Hence T - T' = 0 and, therefore, T’ is perpendicular to T. 


See Figure 1.15 and note that, while T and T’ are perpendicular, perspective influences the 
picture. We say that 7’ is normal to B. Because there is no reason why the length of T’ should 
be 1, however, we define the curvature function of B to be 


x(s) = |T'(s)I. 


Of course « > 0 and x increases as 8 turns more sharply. If « = 0 everywhere, then, as we will 
see in Theorem 1.3.20 below, we know everything about the curve 6 already. Therefore, for the 
discussion below, assume x > 0. 


Exercise* 1.3.2. Suppose § is a unit speed curve in the xy-plane. Show that, up to sign, the 
curvature of 8 is given by ra where @ is the angle between T and e; = (1, 0). 
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Figure 1.15. Curve with T and T’ 


Since we would like our fundamental quantities to be unit vectors, we define the principal 
normal vector along B to be N = LT’. For this to make sense we need x # 0 along the curve, 
so we take x > 0. Then, by the definition of curvature, |N| = 1. We need to have a third vector 
along 8 as part of our 3-fingered orientation in 3-dimensional space and this third vector should 
be perpendicular to both T and N (just as T and N are to each other). Before we make the 
definition, we recall some facts about another vector operation, the cross product. 

Unlike the dot product of two vectors (which produces a real number), the cross product of 
two vectors produces another vector. Let v = (v!, v2, v>) and w = (w!, w?, w?). Then we define 
the cross product of v and w to be (in coordinates), 


2443 B23 ola 3 


vxwe (vw — vw’, vw! — vw, vw? 


- vw!), 


An easy way to remember this formula is to compute the “determinant” 


where e,, e2, e3 denote the unit vectors (1,0, 0), (0, 1, 0) and (0, 0, 1) respectively. Note that 
the traditional notation of vector calculus denotes these vectors by i, j and k. 


Exercise# 1.3.3. Compute the determinant and verify that it gives the formula for v x w. 
Exercise# 1.3.4. What happens if the 2nd and 3rd rows of the determinant are interchanged? 


Exercises 1.3.5. Verify (by tedious computation or Maple) the following formula known as 
Lagrange’s Identity: 


lv x wi? = [vi7|wi? — (v- w)’. 


Exercise# 1.3.6. Use the previous exercise to show that 
|v x w| = |v| [w| sin@ 


where @ is the angle between v and w. Note then that v x w = 0 if and only if v and ware parallel 
(.e., W = av fora € R). 
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Vv 


Figure 1.16. A parallelogram 


Exercise 1.3.7. Show that v-(v x w) = 0 = w- (v x w). Therefore, the cross product of two 
vectors is a vector perpendicular to both of them. 


Exercises 1.3.8. Show that |v x w| is the area of the parallelogram spanned by v and w (as 
shown in Figure 1.16). 


Now, knowing what we do about the cross product (and always assuming « > 0), we may define 
the binormal along B to be B = T x N. Note that |B| = |T| |N[sin(90°) = 1-1-l=1,soB 
is already a unit vector. Also, as we have seen above, B- T = 0 = B-N. The set {T, N, B} is 
called the Frenet Frame along B (see Figure 1.17). 

The measurement of how T, N, and B vary as we move along f will tell us how £ itself 
twists and tums through space. The variation of T, N, and B will be determined by calculating 
the derivatives T’, N’, B’. We already know T’ = « N by definition of N, so the curvature « 
describes T’s variation in direction. We must still find B’ and N’. Now, T, N and B are three 
mutually perpendicular unit (i.e., orthonormal) vectors in the 3-dimensional space R?, so any 
vector in R? is some linear combination of them. In particular, B’ = aT +bN +c B.Ifwecan 
identify a, b and c, then we will know B’. To do this we use what we know about 7, N and B. 
First, 


T-B'=aT-T+bT-N+cT-B 
=a-1+b-0+c-0 


=a. 


Figure 1.17. Frenet frame along a curve 
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Similarly, N - B’ = b and B - B’ = c. Therefore, 
B’=(T-B')T+(N-B)N+(B-B)YB. 


Now let’s identify T - B’. We know T - B = 0, so thatO =(7T- BY =T'-B+4+T- B’ by the 
product rule. Then, using N - B = 0, we obtain 


T-B'=-T'-B 


lI 
| 
x 
= 
bw 


We can also identify B - B’. We know B-B=1,so0=(B-B) =B'-B+B-B'=2B.- B’. 
Thus, B - B’ = 0. Now we are left with a single possibly nonzero coefficient in the expression 
for B’. Since we cannot immediately identify it in terms of known quantities, we give it a name. 
Define t = —N - B' to be the torsion of the curve B. By what we have said above, 


B'=—-tN. 
Now we find N’. Just as for B’, we have 
N’=(T-N’)T+(N-NYN+(B-N’)B. 


The same types of calculations give T-N =0,so0=T’-N+T-N’andT’=xNsoT-N’= 
—«kN-.N=-—x«x.Also,N-N =1,soN-N’=OandB-N =0,soB’-N+B-N’ =0. Hence, 
B.N'=—B'.N=-—N.-B’ = t by definition. Thus, 


No =-xkT+ TB. 


Theorem 1.3.9 (The Frenet Formulas). For a unit speed curve with x > 0, the derivatives of the 
Frenet frame are given as follows. 


q 
li 


KN 


= 
II 

| 
= 
J 
+ 
a 
& 


Example 1.3.10 (Circle Curvature and Torsion). 

Consider a circle a(t) = (r cos £,r sin £, 0). The derivative of @ is a'(t) = (1/r)(—r sin(t/r), 
rcos(t/r), 0) = (—sin(t/r), cos(t/r), 0) with |a’(t)| = 1, so w has unit speed and T =a’. 
Now T’ =a” = (-(1/r)cos(t/r), —(1/r) sin(t/r), 0) = —(1/r)(cos(t/r), sin(t/r), 0) where 
(—cos(t/r), — sin(t/r), 0) isa unit vector and thus equal to N. Now, T’ = +(1/r)N,sox = L/r. 
That is, for a circle of radius r, the curvature is constant and is equal to 1/r. This makes sense 
intuitively since, as r increases, the circle becomes less curved. The limit « = (1/r) > 0 reflects 
this. Now, we wish to understand the torsion of a circle. 


; 1. ft 1 t 
N= (- sin() ‘ -*c0s(*), 0) 
r r r r 
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But the Frenet Formulas say N’ = —« T + t B, so we must have t = 0. Therefore, the circle has 
zero torsion. We shall see a general reason for this fact shortly. 


Exercises 1.3.11. The Frenet Formulas help us to identify curves even when they are given in 
rather complicated parametrized forms. The curve below will also appear in Exercise 1.5.4. Let 


B(s) = (ee | ae 4) for-l<s <1. 


(1) Show that 6 has unit speed. 


(2) Show that « = 1/,/8(1 — s2). 


(3) Show that N = (4), VAM) 0) and B=T x N = (>4#, Ms) 


(4) Show that t = x. 


Exercise 1.3.12. If a rigid body moves along a curve a(s) (which we suppose is unit speed), 
then the motion of the body consists of translation along @ and rotation about a. The rotation 
is determined by an angular velocity vector w which satisfies T’ = w x T, N’ = @ x N and 
B' =a x B. The vector o is called the Darboux vector. Show that w, in terms of T, N and B, 
is given by w = tT +x B. Hint: writew = aT +bN +c B and take cross products with T, N 
and B to determine a, b and c. 


Exercise 1.3.13. Show that T’ x T” = «2m where w is the Darboux vector. 


Exercise 1.3.14. Instead of taking the Frenet frame along a (unit speed) curve a: [a, b] > R’, 
we can define a frame {T, U, V} by taking T to be the tangent vector of @ as usual and letting U 
be any unit vector field along a with T - U = 0. That is, U: [a, b] > R? associates a unit vector 
U(t) to each t € [a, b] which is perpendicular to T(t). Define V = T x U. Show that the natural 
equations (i.e., “Frenet Formulas’) for this frame are 


T’ = @; U —a@V 
U' = —ow3;T +a, V 
Voo= @, T —@,U 


where w,, w2 and w3 are coefficient functions. Furthermore, show that the Darboux vec- 
tor w satisfying T’ =a x T, U'’=w@x U and V’ =a x V is given by a =@,T+a.U+ 
W3 V. 


Exercise 1.3.15 ([Sco95]). A unit speed curve of constant precession is defined by the property 
that its (Frenet) Darboux vector w = t T + x B revolves about a fixed line in space with constant 
angle and constant speed. Show, by following the steps below, that a curve of constant precession 
is characterized by having 


k(s) = —a sin(bs) and t(s) = acos(bs) 


where a > 0 and b are constants andc = a? + b?. First, show that the following five properties 
are equivalent for a vector A= w + DN and a line £ parallel to A(0): (i) jw| = a; (ii) cosé = 
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Figure 1.18. Linking number orientations 


a/c, where @ is the angle between w and A; (iii) |N’| = a; (iv) cos(z/2 — 0) = b/c, where 
x /2 — sgn(b)@ is the angle between A and N; (v) |A] = c. Second, given any of the properties 
just listed, show that A is always parallel to @ (i-e., A(O)) if and only if w’ = —b N’. Finally, show 
that A’ = 0 if and only if 


t =bk and K = —bt. 


Solve this system of differential equations to get the result. Maple’s “dsolve” command may be 
useful. An explicit parametrization for curves of constant precession is given in the Maple section 
at the end of this chapter. 


Exercise 1.3.16. Let a and B be two closed curves (i.e., a(a) = a(b) and similarly for 8) which 
do not intersect. The linking number of a and B, Lk(a, B), is defined by projecting R? onto a 
plane so that at most two points of a and £8 are mapped to a single image. Then, assigning +1 to 
every a undercrossing according to the orientations shown in Figure 1.18 and summing the +1’s 
gives Lk(a, B). Let {T, U, V} be a frame as in Exercise 1.3.14. The twist of U about a is defined 
to be 


1 b 
Tw(a, U) = =| @; dat. 
Ja 


Let 6 =a+eU, where ¢€ is small enough so that a and 8 do not intersect. Show that, if a is 
a plane curve, then Lk(a, 8) = Tw(a, U). This is a special case of White's formula which has 
proved important to modern DNA research (see [Poh80, PR78] for instance). 


Exercise 1.3.17. For the Frenet frame {7, N, B}, show that the twist of the principal normal N 
about a is (up to a multiple of 1/27) simply the total torsion of a. That is, show 


1 b 
Tw(a, m= f t dt. 
Ja 


Example 1.3.18 (Helix Curvature). 
Consider the helix B(s) = (acos*, asin£, %) with c = (a? + b*)'/?. Now Bs) = (—(a/c) 
sin(s/c), (a/c) cos(s/c), b/c) with |B’(s)| =a*/c +b? /c? =c?/c2 =1, so B has unit 
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B(O) 


Figure 1.19. Plane through £(0) normal to B 


speed. Thus T = f(s) and T’ = B’(s) = (—(a/c’) cos(s/c), —(a/c”) sin(s/c), 0) = (a/c?) 
(—cos(s/c), —sin(s/c), 0) = (a/c?) N. Thus 


a 
= SS 
ce (a? +b?) 


a constant. 
Exercise 1.3.19. Show that t = b/c? = b/(a* + b*), a constant. 


Constraints on curvature and torsion produce concomitant constraints on the geometry of a curve. 
The simplest constraints are contained in the following 


Theorem 1.3.20. Let B(s) be a unit speed curve. Then, 
(1) x = 0 ifand only if B is a line; 


(2) for x > 0, t = 0 ifand only if B is a plane curve. 


Proof. (1) Suppose £ is a line. Then 6 may be parametrized by B(s) = p+ sv with |v| = 1 (so 
B has unit speed). Then T = f’(s) = v constant, so T’ = 0 and, consequently, « = |7’| = 0. 

Suppose «x = 0. Then [7’| = 0 by definition, so 7’ = 0 and, therefore, T = v is aconstant (with 
iv| = 1 since B has unit speed). But B’(s) = T = v implies B(s) = p+ sv with p a constant of 
integration. Hence, f is a line. 

(2) Suppose t = 0. Then, by the Frenet Formulas, B’ = 0, so B is a constant. But this means 
B(s) should always lie in the plane perpendicular to B. We show this. Take the plane through the 
point (0) and perpendicular to the normal vector B (see Figure 1.19). Recall that a point P is in 
this plane if and only if (P — £(0))- B =0. 

We will show that, for all s, (B(s) — B(0))- B = 0. We have ((B(s) — B(0))- BY = (B(s) — 
B(0)Y - B + (B(s) — B(0))- B’ = B’'(s)- B=T-B=0. Hence, (f(s) — B(0))- B = constant. 
To identify this constant, evaluate the expression at s = 0. We get (8(0) — £(0))- B = 0. Then 
for all s, (B(s) — B(0))- B = 0 and, hence, f(s) is in the plane determined by £(0) and the 
constant vector B. 
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Suppose £ lies in a plane. Then (as we noted above) the plane is determined by a point P and 
anormal vector n # 0. Since £ lies in the plane, 


(B(s) — P)-n =0 for all s. 


By differentiating twice, we obtain two equations: ’(s) -n = Oand B’(s)-n =0.Thatis, T-n = 
0 and « N-n =0. These equations say that n is perpendicular to both T and N. Thus n is a 
multiple of B and 

tn/|n| = B. 


Hence B is constant and B’ = 0. The Frenet Formulas then give t = 0. 0 


So now we see that curvature measures the deviation of a curve from being a line and torsion 
the deviation of a curve from being contained in a plane. We know that the standard circle of 
radius r in the xy plane has tr = 0 and « = 1/r. To see that a circle located anywhere in R? has 
these properties, we have two choices. We could give a parametrization for an arbitrary circle in 
R? or we could use the direct definition of a circle as the collection of points in a plane which 
are a fixed distance from a given one. In order to emphasize geometry over analysis (for once), 
we take the latter approach. 


Theorem 1.3.21. A curve B(s) is part of a circle if and only if x > 0 is constant and t = 0. 


Proof. Suppose f is part of a circle. By definition, B is a plane curve, so t = 0. Also by 
definition, for all s, |8(s) — p| = r. Squaring both sides gives (8(s) — p) - (B(s) — p) = r?. If we 
differentiate this expression, we get (for T = 6’) 


2T -(B(s)—p)=9 or T-(A(s)—p)=0. 
If we differentiate again, then we obtain 
T'-(B(s) —p)+T-T =0 
«N -(B(s)—p) + 1=0 (*) 
« N -(B(s)—p) = —1. 
This means, in particular, that « > 0 and N - (B(s) — p) ¥ 0. Now differentiating (*) produces 


d 
—N - (Bs) — p) +« N’ -(B(s) — p) +e N-T =0 


dx 
a (B(s) — p) + «(—KT + 7B) - (A(s)—p) +0 =0. 
Since t = 0 and T - (B(s) — p) = 0 by the earlier calculation, we have 
dk 
—_ N -(B(s) — p) =0. 
Ss 


Also, N - (B(s) — p) 4 0 by the work above, so = = 0. This means, of course, that « > 0 is 
constant. 

Suppose now that tr = 0 and « > 0 is constant. To show A(s) is part of a circle we must show 
that each B(s) is a fixed distance from a fixed point. For the standard circle, from any point on 
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the circle to the center we proceed in the normal direction a distance equal to the radius. That is, 
we gorN = 1/« N. We do the same here. Let y denote the curve 


1 
y(s) = Bis) + —N. 
K 


Since we want y to bea single point, the center of the desired circle, we must show that y’(s) = 0. 
Computing, we obtain 


1 
y(s)= fh (s)+—N’ since x is a constant 
K 


1 
=T+—(-«T + 7B) 
K 


=T-T since tr = 0 
= 0. 
Hence y(s) is a constant p. Then we have 
! 1 
156) - pi = |- =| aed 
K K 
so p is the center of a circle B(s) of radius | /x. fa) 


Exercise 1.3.22. Show that, if every tangent line of a unit speed curve 8 passes through a fixed 
point p, then f is a line. Hint: p = B(s) + r(s)§’(s) for some function r(s). 


Exercises 1.3.23. Let a@ be a unit-speed curve which lies on a sphere of center p and radius R. 
Show that, if r # 0, then a(s) — p= —4N —(4)'1B and R? = (4 + (4) 2). 


Exercises 1.3.24. Show that, if (ty % 0 and (ty + q4y+y is a constant, then a (unit speed) 
curve a lies on a sphere. 


Exercise 1.3.25. (Come back to this after the next section.) Compute the curvature « and torsion 
t of the curve 


a(t) = (a cos’(t), a sin(t)cos(t), a sin(t)). 


Then, for a = 1, graph a(t) and the sphere of radius 1 centered at the origin on the same set of 
axes. Also, show that the curve a(t) is a spherical curve by computing 


ry ely a : 

K t\x/ ja’(t)| 
to be a constant (which must be the squared radius of the sphere on which @ lies). Note that the 
extra |a’(t)| = ds /dt is necessary since a is not unit speed and the derivative of 1/« here is with 


respect to t whereas the derivative of 1/x above was with respect to arclength s. Hint: use Maple 
and formulas from the next section. 
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Exercise 1.3.26. Find a parametrization for the curve obtained by intersecting the following 
sphere and cylinder: 


eoy4? =4e?, (x ay +y? =a’. 


Hint: x = acost +a, y =asint work in the cylinder. Plug these x and y into the sphere’s 
defining equation and solve for z. This curve is known as Vivianis curve. Compute the curvature 
and torsion of Viviani’s curve and (since we know the curve is spherical!) verify that the formula 


car oy ells ees eee eee 
ee) eo) 


holds. Finally, graph Viviani’s curve on the sphere. Compare Viviani’s curve with the curve of 
Exercise 1.3.25 and make a conjecture. Can you prove your conjecture? Again, formulas from 
the next section are required or Maple may be used. 


Exercise 1.3.27. For the curve B(s) = (z cos(s), sin(s), F cos(s)), find « and t. Identify the 
curve. 


Exercise 1.3.28. Let @ be a unit-speed curve. Show that there is a unique circle 6 with 
B(O)=a0) pO)=a'(0) =p") = a"(0). 


Hint. Start with B(s) = p + R cos(%)v + R sin(z)v2 where v; and v2 are orthonormal vectors. 
Your proof will show that 8 lies in the plane spanned by T and N. This is the osculating plane 
of a and £ is called the osculating circle of a. Your proof will also show that the radius of 8 
is exactly 1/x,(0). In this sense £ is the circle of closest fit to a at a(0). Here are two things to 
think about. Does the osculating circle always lie on one side of the curve? How could you use 
the idea of the osculating circle and a computer to calculate curvature at points of a curve? 


Before we leave this section, we would like to consider the effect of curvature on arclength 
as a curve evolves. In general, in differential geometry, we are interested in the rates of change 
of geometric quantities such as arclength, area and volume. In this way, calculus impinges on 
geometry through the expression of the rate of change of shape characteristics. This approach 
comes from the calculus of variations, the subject of Chapter 7. Let’s consider a specific instance. 


Example 1.3.29 (Rate of Change of Arclength). 

Let a(s) be a unit speed closed curve with arclength Lo and suppose we push it outward in 
a normal direction. In other words, consider the evolved curve B(s) = a(s) + € 6(s) where we 
assume 6 - a’ = 0 always. We wish to know how the arclength changes initially (i.e., as © > 0) 
under such an evolution. Note that 


B'- Bl =a'-a'+2€a'- 8 +625. 5) 


so that 6’s arclength becomes a function of €: 


Lae) = f Vala +2ea FAT Has, 
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The initial rate of change of Lg(€) is given by 
-(f 2a’-6’+2€8'- 5’ ) 
o5- 2Va’-a’ +2€a'- 8 +628 - 8) |e 


fe 5 
= [tas 
lo’| 


= fe -8'ds since a is unit speed 


dL g(e) 
de 


== fa" bas since 0 = (a’- 6) =a” -5+0'-8 


=— / KN -éds since a” = x N by the Frenet Formulas. 


If we choose 5(s) = N(s) (a’s principal normal), then we obtain L'(0) = — f x ds. This example 
is generalized to curves lying on surfaces in Exercise 5.1.4. 


Exercise 1.3.30. Find the initial rate of change of the circle of radius r as it is pushed out along 
its principal normal. There is no need to use calculus here. Verify your answer by integrating the 
curvature of this circle from 0 to 27r (since we take a unit speed circle). 


1.4 Non-Unit Speed Curves 


Although all regular curves have theoretical unit speed parametrizations, some (such as the 
ellipse) do not admit explicit ones. Therefore, in order to understand the geometry of these 
curves we must obtain modifications of the Frenet Formulas which account for non-constant 
speed. 

Let a(t) be an arbitrary curve with speed v = |a’(t)| = ds/dt. Theoretically, we may 
reparametrize a to get a unit speed curve @(s(t)), so we define curvature and torsion of @ in 
terms of its arclength reparametrization a(s(t)). Moreover, the tangent vector a(t) is in the di- 
rection of the unit tangent T(s) of the reparametrization, so T(s(t)) = a’(t)/|a’(t)|. This says that 
we should define a non-unit speed curve’s invariants in terms of the invariants of its unit speed 
reparametrization. 


Definition 1.4.1. We make the following definitions: 


(1) The unit tangent of a(t) is defined to be T(t) = T (s(t)). 


(2) The curvature of a(t) is defined to be x(t) = K(s(t)). 


(3) If« > 0, then the principal normal of a(t) is defined to be N(t) zh N(s(t)). 


(4) If« > 0, then the binormal of a(t) is defined to be B(t) = B(s(t)). 


(5) Ifx > 0, then the torsion of a(t) is defined to t(t) = T(s(t)). 
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Don’t be confused by this definition. Its main function is to aid us in calculating things like 
tangent and normal vectors (and their derivatives) by using the chain rule. As an example, suppose 
we want to find T(t). By definition, T(t) = T(s(t)), where T(s(t)) is the unit tangent vector for 
the arclength reparametrization of a(t), @(s(t)). We then have 


da(s) da(t) dt _da(t)1 _ da(t) 1 
ds dt ds dt v dt |a(t)j 


T(t)=T(s)= 


This is, of course, exactly what we want. Thus, Definition 1.4.1 should just be viewed as a 
convenient way to remind the reader to use the chain rule! 


Theorem 1.4.2 (The Frenet Formulas for Non-Unit Speed Curves). For a regular curve a(t) with 
speed vy = ds/dt and curvature x > 0, 


T(t) = KUN 
Nt) = -kvT +tvB 
Bt) = —tvN 


Proof, The unit tangent T(t) is T(s) by definition. Now T’(t) denotes differentiation with respect 
to t, So we must use the chain rule on the right-hand side to determine « and tT. 


dT(s) d. 
a a a 
= K(s) N(s)v 
= k(t) N(t)v(t) by definition 
T'’ =kvN, 


so the first formula is proved. For the second and third, 


by the unit speed Frenet formulas, 
=—-KvT+TVB. 


and 

dB(s) ds 
ds at 

=—tNv 


=—-tvN. 


BO = 


Lemma 1.4.3. a! = vT anda" = = T+kv?QN. 
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Proof. Since a(t) = @(s(t)), the first calculation is, 


ds 
i t = =F eek 
ey = a5) 5 
= vT(s) 
= v(t) T(t) 
while the second is 
d 
a") = STO) + VOTH) 
dv 
ie T(t) + v(t)e(t)u(t) N(o) 
dv 2 
= a T(t) + K(t)v*(t) N(t). 
O 
Exercises 1.4.4. Work out the following precisely. The formula for a” allows an analysis of the 
following physical situation. Suppose a car is going around a level highway curve at constant 
speed (only for convenience). Then the centripetal acceleration of the car is a” = xv* N. The 
only force acting on the car is the frictional force required to keep the car on the road. As long 
as the tires roll and do not skid, this force is given by a coefficient of friction y (typically + .60) 
multiplied by the weight (i.e., mass x gravitational constant) mg of the car. By Newton’s Law, 
to keep the car on the road we require 


pumg > mkv? or ug >Kv’. 


For a road of given curvature x, to stay on the road, speed must satisfy 


What changes if the road is banked? 


From the definitions above and what we know about unit speed curves, for a non-unit speed 
curve a, we have T = a’/|a’|, B =T x Nand N = B x T. Wealso have the 


Theorem 1.4.5. For any regular curve a, the following formulas hold. 


() B= pee 
lar’ x a” | 
Jo’ x a” 
c=, 
(2) k ia’ 
QB) a (a x a”) . a” 


la’ x a"? 
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Proof. For (1), we use the formulas of Lemma 1.4.3 to get 
, " dv 2 
axa =(vT)x qe ees, 


dv 3 
SUSE ME ey TxWN 


=O+«Kv B. 


a” 


———.. For (2), we simply solve |a’ x a”| = «v? for «. For 
ja’ x a” | 


i 
a’ xa 
Hence, ja’ x a” | =«v?,so B= 


(3), we take the third derivative 


d t 
a” = (Sr + «n) 
dt 


a ry eat” Weert, 
— —v vp—. KV 
~ de dt dt edt 


Therefore, since T’ = « N and B is perpendicular to T and N, we get 


Ba” =kv’ B-N' 
=xv? B.(-kvT + vB) 


=Ktv 


since B- T = Oand B- B= 1. Nowa’ x a" = kv’ B, so 


(a’ x a”) . a” = Pave B aq!” 
= «v3(Ktv>) 
HSK Vt. 


Of course, Ja’ x a” |* = x?v®, so we have 


2 (a x a”) Ql! (a’ x a”) a!” 


K2p ~ la’ x at!” |2 


Exercise¢ 1.4.6, For the ellipse a(t) = (a cost, b sint), show that 
« = ab/(a’ sin? t + b* cos? t)>”. 


Note the case of the circle a = b. Show that a formula for the curvature of any plane curve 
a(t) = (x(t), y(t) is given by 


|x'y “u —x"y'| 


(x!? ah y? 3/2 


Ka = 
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Exercise 1.4.7. Compute 7, N, B and the curvature and torsion for the curve A(t) = 
(e' cost, e’ sint, e'). 


The curve a(t) = (cosht, sinht, t) is known as a hyperbolic helix. Recall that the hyperbolic 
trigonometric functions are defined by the formulas, 


t -t f fort ft —t 
ete : e-—e e-e 
cosht = , sinht = , tanhr = ———_. 
2 2 e+e 


We have the fundamental identity cosh? t — sinh’ t = | as well as the derivative formulas 


d d 
aon) = sinhf, 7, “sinh t) = coshf. 


Exercise 1.4.8. For the hyperbolic helix, show that 


Q) T= sq (tanh t, 1, secht) where secht = 


on 
coshr? 
ne 
(2) B= ( 
(3) N =(sechr, 0, —tanhr), 


—tanhr, 1, —sechr), 


(4) « = 1/(2 cosh? r), 
(5) t = 1/(2cosh? 1). 


Exercises 1.4.9. Compute s(t) = i ja’(u)|du for the hyperbolic helix a. 


Exercise 1.4.10. Let a(t) be a not necessarily unit speed curve with involute Z(t). Show that the 
curvature of the involute Z(t) is given by 


v Kot? + Te(t? 
S(t) Ka(t) 


kz(t) = 


where s(t) is a@’s arclength function. What does the formula become if @ is a plane curve? 
Compute the curvature of the involute of the unit circle directly and from the formula 
above. 


Example 1.4.11 (Plane Evolutes). 
The plane evolute of a plane curve a(t) is the plane curve whose involute is w. A formula for the 
evolute is given by 


E(t) =a(t)+ a N(t) 
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where N is the normal of @. Note that the evolute is the curve consisting of the centers of the 
osculating circles associated to a. To see that the involute of € is a, first compute 


1\’ 1 
gaa! + ( ) ween 
kK 


K 


Le I 
=a'+(-) N+-—(-KvT +7tvB) 
kK K 


1 ? 
“+ (<) N—-vT since tT = 0 
K 


] ? 
= (=) N sincea’ = vT. 
K 


The length of €’ is then |E’| = | (+)’ |. Note then that the unit tangent vector Ts of the evolute is 


L 
K 


the normal N of a. The involute of the evolute is then given by 


Te(t) = E(t) — s(t) Te(t) 
= E(t) — s(t) N(t) 


SGC NSoW 
K K 
= a(t). 


Here we have used the fact that the arclength function for the evolute is s(t) = a This fol- 
lows since s(t) = fy |E|du = fy | (4)’ |du = +. We note here that evolutes may be defined 
for 3-space curves also. The formula for a space evolute is € = a + 4 N + £ B where c isa 
constant. 


Example 1.4.12 (The Plane Evolute of a Parabola). 
Suppose a parabola y? = 2ax is given with parametrization a(t) = (t?/(2a), t). Then a(t) = 
(t/a, 1), v = |o’| = Va* +12/a and, therefore, T = (t//a? + t2,a/Va? + t2). From this we 


may calculate curvature, 


T= a —at 
= (a2 + t2)3/2’ (a2 + 12)3/2 


=KvN 
_ vat +t? a 
7 a 
and taking lengths gives 
a Va? +t? 
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0.4 


0.2 


Figure 1.20. Neil’s parabola 


Hence, « = a?/(a? + t*)'/? and, putting this in the expression for T’, N = (a/J/a? +4 ??, 
—t//a? + t*). Finally, the evolute is then given by 


1 
E(t) = a(t) + wa N(t) 


ay fx ? 4 (a2 + ry/ a =i 

~ 2a’ a2 Va+t. az + 72 
Oe Cogs eee 5 

~ 2a at J 


Exercise 1.4.13. Show that the x and y coordinates of E(t) satisfy the relation 
Tay? =8(x —ay. 


This curve is known as Neils parabola (see Figure 1.20). 
Exercise 1.4.14. Show that the evolute of the ellipse a(t) = (a cost, b sint) is the astroid 


a —b? 3 b—a? . 3 
cos” f, sin t, }. 
a b 


Exercise 1.4.15. Show that an evolute of the catenary a(t) = (t, cosht) is given by (¢ — cosht 
sinht, 2 coshtr). 


Exercise 1.4.16. Show that an evolute of the astroid a(t) =(a cos’f,a sin’t) is an ex- 
panded, rotated out-of-phase astroid. Hints: After computing the evolute, take an astroid 
(2a cos? t, 2a sin’ f), rotate it by an amount guessed from the graph and let r = f — 1/4. 
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1.5 Some Implications of Curvature and Torsion 


We have already seen some of the ways in which the Frenet Formulas detect geometric in- 
formation about the curve. As a more complicated example we will now consider a class 
of curves called cylindrical helices. In order to understand the definition of these curves, let 
us consider the standard helix, a(s) = (acos£, asin£, %) where c = (a? + b?)'/?. Consider 
T =a'(s) =(—“sin=, $cosé, 8), Notice that the third coordinate is constant, so T -e; = ° 
(where e; = (0,0, 1) as usual). Hence, cos@ = 2, so the angle 8 between T and e3 never 
changes. 

We may generalize this property by saying that a curve a, which we assume has unit speed, is 
a cylindrical helix if there is some constant unit vector u such that T - u = cos @ is constant along 
the curve. In fact, 8 may be specified by saying that it is the angle in the T-B-plane through which 
T must be rotated to reach u, where the rotation begins toward B through the right angle between 
T and B. The direction of the vector u is the axis of the helix. We can identify cylindrical helices 
by a simple condition on torsion and curvature. 


Theorem 1.5.1. Suppose a has « > 0. Then a is a cylindrical helix if and only if the ratio = is 
constant. 


Proof. Without loss of generality, assume a has unit speed. Suppose T - u = cos 6@ is constant 
for some unit vector u. Then 0 = (T -u)’ = T’-u=kN-u,soN-u=Osincex > 0. Now, u 
is perpendicular to N, so 


u=(u-7)T+(u-B)B 
=cosé7T +siné@ B. 


The diagram in Figure 1.21 explains the coefficient of B. Because u is constant, u’ = 0. Hence, 


0=cosdT’ +sin@ B’ 
=xcosdN —tsinOdN 


= (xk cos@ — tsin@) N. 


Thus, « cos@ — t sin@ = 0, which gives cot@ = t/«. Therefore, t/« is constant since @ is. 


Figure 1.21. 
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B(s) 


B(0) 
Figure 1.22. 


Now suppose t/x is constant. The cotangent function takes on all values, so we can choose 0 
with cot = t/x. Define u = cos 6 T + sin@ B to get 


u’ =(kcos6 —tsind)N = 0. 


Hence, u is a constant vector and clearly T -u =cos@ is constant. Thus a@ is a cylindrical 
helix. Oo 


In order to get a good geometric picture of a cylindrical helix we might want to project the helix 
back onto the plane determined by a starting point (8(0)) and the fixed (normal) vector u. The 
picture is given in Figure 1.22. 

To do this, we subtract the u-component of B(s) — B(0) from B(s). Now, this u-component 
has length 


h(s) = (B(s) — B(0))- u 
and to see what this is in terms of the definition of 8 as a cylindrical helix, we differentiate, 


dh ’ 
— =f -u=T-u=cosé, 
ds 


obtaining a constant. Hence, integrating the derivative above gives A(s) = scos@ +C. Now 
h(0) = 0 =0+C,soC = Oalso. Finally, A(s) = s cos@. Therefore the projection of the cylin- 
drical helix B(s) onto the plane through £(0) and perpendicular to u is given by: 


y(s) = B(s) — AG)u, 


where A(s) = s cos@. Note that this parametrization of y is not necessarily a unit speed parame- 
trization. 


Exercise* 1.5.2. Show that x, = kg/sin* 6, where xg is the curvature of . 


Exercise 1.5.3. A cylindrical helix £ is said to be a circular helix if the projection y 1s a circle. 
Show that £ is a circular helix if and only if t and « are constant. 
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Exercises 1.5.4, What can you now say about the curve considered in Exercise 1.3.11, 


_ (i+sy? (i -—sp? Ss 
y= (“SS Ss). 


Exercises 1.5.5. What do you know about the curve A(t) = (t + V3 sin(t), 2 cos(t), V3 t — 
sin(t))? 


Exercise 1.5.6. Show that A(t) = (at, bt*, t°) is a cylindrical helix if and only if 4b4 = 9a?. 
In this case, find the vector u such that T = B’/|f’| has T - u = constant. 


Exercises 1.5.7. Let B(s) be a curve and suppose that, for every s, N(s) is perpendicular to a 
fixed vector u. Show that £ is a cylindrical helix. Conversely, show that, for any helix, the normal 
N is perpendicular to the axis u of the helix. 


Exercise 1.5.8. Show that, for any helix, the Darboux vector @ is parallel to the axis u of the 
helix. (See Exercise 1.3.12.) 


Exercise 1.5.9. Define a curve A(s) by B(s) = de B,(t)dt, where a(t) is a unit speed curve with 
T, > 0. Show that kg = T., Tg = Ka, Tg = By, Ng = —Nq and Bg = T,. Now suppose @ is a 
circular helix and show that £ is as well. 


Exercise 1.5.10. Show that the space evolute €(s) of a plane curve A(s) is a cylindrical helix. 
Hint: Take €(s) to be unit speed and note, by definition, that the involute of € is 8. Compute 
torsion of 6 and set it to zero since B is a plane curve. This gives conditions on derivatives of €. 
Use the Frenet formulas applied to €. 


Theorem 1.5.1 shows us how placing a restriction on a quantity such as t/x can determine a 
type of curve. We can see this again if we go one step further than Theorem 1.5.1 and consider 
the requirement that t/x be a linear function of the arclength parameter s. In order to put this 
in context, recall that by Theorem 1.3.20, t = 0 exactly when a curve is contained in a plane. 
Another way to say this is that a unit speed curve a(s) is in the plane spanned by T(s) and N(s) 
precisely when t(s) = 0. The plane (7(s), N(s)) has a name; it is called the osculating plane 
(see Exercise 1.3.28). On the other hand, by Exercise 1.3.23, we see that if a unit speed curve 
a(s) is a spherical curve, then, for some fixed point p, we have a(s) — p contained in the plane 
(N(s), B(s)). Of course there is one more combination of Frenet frame vectors: {7 (s), B(s)}. 
We say that a unit speed curve a(s) is rectifying if, for some fixed point p, a(s) — p is always 
contained in the plane (7(s), B(s)). We then have the following characterization. 


Theorem 1.5.11 ((Che03]). A unit speed curve a(s) with k > 0 is rectifying if and only if 
ae as+b 
K 


for constants a and b witha # 0. 
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Proof. Suppose a(s) is rectifying. Therefore, we can write a(s) = A(s) T(s) + u(s) B(s) for some 
functions A and yz. We then compute as follows. 
a(s)=NTHAT’ +p B+uB’ 
=NT+AKN+p'B-ptN 
T=NT+(Axk—pt)N+y’'B 


Now, T, N and B form a basis (for each s), so we must have 
N=1 Ac — pt =0 py’ =0. 


Integrating, we obtain 
T 
Ms)=stk -=-— w=c. 
K 


These equalities give t/k =as+b, wherea =1/c,b=k/c. 
Now suppose that we are given t/k =as+b=(s+k)/c,wherec = 1/a,k = b/a. Consider 
the derivative: 


< (als) = (8 +4) T(5) —€ Bs) = T ~T = (84 Te B' 


=-(st+k)«N+ctN 
=(-(s+k)k +ct)N 
=0 


since ct = (s + k)x by assumption. Therefore, a(s) — (s + k) T(s) — c B(s) is a constant p and 
rearranging gives a(s) — p =(s +k)T(s)+c B(s), thereby showing that a(s) — p is contained 
in the plane (T(s), B(s)). O 


Remark 1.5.12. In fact, in [Che03], it is shown that a(s) is rectifying if and only if it can be 
re-parametrized to have the form a(t) = a sec(t) B(t), where A(t) is a unit speed curve on the 
unit sphere S* and a > 0. 


We have seen that curvature and torsion, individually and in combination, tell us a great deal 
about the geometry of a curve. In fact, in a very real sense, they tell us everything. Precisely, 
if two unit speed curves have the same curvature and torsion functions, then there is a rigid 
motion of R? taking one curve onto the other. Furthermore, given specified curvature and torsion 
functions, there is a curve which realizes them as its own curvature and torsion. These results 
are, essentially, theorems about existence and uniqueness of solutions of systems of differential 
equations. The Frenet Formulas provide the system and the unique solution provides the curve. 
We shall not prove these statements, but instead indicate portions of the proofs by the following 
exercises, 


Exercise 1.5.13. Suppose we want a plane curve f(s) with curvature a specified curvature 
function «(s). Show that such a 8 may be defined by 


B(s) = Gs cos O(u) du, [ sin au) du) 
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where 6(u) = i x(t) dt. Hints: use the Fundamental Theorem of Calculus to show £ has unit 
speed and then compute T;. 


Exercise 1.5.14, Find a unit speed plane curve a(s) with kg(s) = 2s and a(0) = (0, 0). The 
resulting curve is called Euler's spiral or the spiral of Cornu. Use the Maple procedure “recreate” 
to graph the curve. We will see in Example 7.6.2 that this curve arises as the solution to a 
variational problem. 


Exercise 1.5.15. Find a unit speed plane curve a(s) with xy(s) = 1/(1 +57) and a(0) = 
(0, 0). 


Exercise 1.5.16. Find a unit speed plane curve a(s) with «,(s) = 1/s and a(1) = (3, —2). 


Exercise 1.5.17. (The Fundamental Theorem of Space Curves) Show that, if two unit speed 
curves a(s) and B(s) have ky(s) = «g(s) and tg(s) = tg(s) for all s, then there is a rigid motion 
(i.e., combination of translations, rotations and reflections) of 3-space which takes a onto f. This 
characterization of space curves by curvature and torsion is one of the foundational results of 
differential geometry. Hints: 


(1) Move (0) to (0) by a translation. 


(2) Rotate and, if necessary, reflect the Frenet frame of 6 at 6(0) = a(0) into the Frenet frame 
of a. 


(3) To show that the Frenet frames now coincide for all s, consider the function 
D(s) = Ta(s) + Tg(s) + Na(s)- Na(s) + Bals)- Ba(s) 


which attempts to measure the deviation of the frames from being identical. Show that 
D(0) = 3, D(s) < 3 for all s and D(so) = 3 only when the frames coincide at so. 


(4) Show that D is constant by calculating D’ = 0. Use the Frenet equations. Hence, D(s) = 3 
for all s. 


(5) Then, a’(s) = f’(s). Integrate and use initial conditions to show a(s) = B(s), hence show- 
ing that the original translation, rotation and reflection carried B into a. 


1.6 Green’s Theorem and the lsoperimetric Inequality 


Closed curves hold a special place in the worlds of geometry and applications of mathematics. 
A curve a: [a,b] > R? is closed if a(a) = a(b) and a(a) = a”(b) for all n > 0, where 
a") denotes the n-th derivative. In applications, closed curves often represent periodic orbits 
of physical systems. This periodicity then allows a complete understanding of the dynamical 
behavior of a system by simply analyzing what happens in a single period. In geometry, there is 
often an important connection between the closed curve and the region inside that it bounds. We 
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will see this later, for instance, in the Gauss-Bonnet theorem. Here we want to present the most 
basic connection between a plane closed curve and the region inside; this connection is called the 
isoperimetric inequality. This inequality expresses a relation between the arclength of a simple 
closed curve in the plane and the area of the enclosed region. (Recall that a curve is simple if it 
has no self-intersections.) 


Theorem 1.6.1 (The Isoperimetric Inequality). Among all simple closed curves in the plane 
having a fixed length, the circle bounds the enclosed region of largest area. In particular, if C is 
a simple closed curve with length L enclosing an area A, then 


L?>4nA 


with equality holding only in the case of a circle. 


The proof we give of the isoperimetric inequality is due to E. Schmidt, but we shall basically 
follow the approach of [Che67]. Also, compare the proof below with the calculus of variations 
approach in Example 7.6.9. Before we begin the proof, we need to recall a fundamental result of 
two-variable calculus. 


Theorem 1.6.2 (Green’s Theorem). Let P(x, y) and Q(x, y) be two real-valued (smooth) func- 


tions of two variables x and y defined on a simply connected region (i.e., a region without holes) 
R of the plane. Then 


ff; oe te axdy = [ Pay-oas 


where the right-hand side is the (counterclockwise) line integral around the boundary C of the 
region R. 


Remark 1.6.3. Note that Green’s theorem may be written in different forms. For instance, an 
equivalent form is given by 


If. SG kay = | Part ay, 


In the rest of the book, we shall use whatever form of Green’s theorem suits a given 
problem. 


Example 1.6.4 (Green’s Theorem and Area). 
Let C be a closed curve enclosing a region R. Take P = x/2 and Q = y/2. Green’s theorem 


then gives 
1 
Area(R) = If. dzdy= ff 5 += sdxdy => f xdy—ydx. 
c 
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Figure 1.23. Closed curve and comparison circle 


Therefore, the area inside a closed curve may be expressed as a line integral. Of course, the line 
integral is evaluated by parametrizing the curve as B(t) = (x(t), y(t)) for a <t < b and taking 
5 foxdy-ydx=4 ff (x(t) y’(t) — y(t) x'(t)) dt. The same arguments say that there are also 
the following identifications: Area(R) = f.xdy = f.—ydx. 


Proof of Theorem 1.6.1. Let the curve C be parametrized by a(s) = (x(s), y(s)) forO <5 < L 
and assume, without loss of generality, that the parametrization is unit speed. Since C is 
a closed curve, we can place it between two parallel lines. We orient the picture so that 
these lines, m and n, are vertical and touch C at points P = a(0) and Q = a(sp) (see Fig- 
ure 1.23). We also place a comparison circle K inside the lines with radius r half the 
distance between the lines. We parametrize the circle by using the x-coordinate of a and 
making the y-coordinate obey the circle relation; B(s) = (8;(s), Bo(s)), where B\(s) = x(s) 
and 


—VJr?—x(sP O<s <5 


+ /r2?—x(s? so<s<L 


Ba(s) = 


Note that (0) = B(L). For C, we can apply Green’s theorem to compute the area inside to be 
Ac =foxdy= tn: x(s) y(s) ds. For the circle, we have 


L 
mr=Az = j — Bo(s)dx -{ —Bo(s) x'(s) ds. 
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Add these to get 


L 
Ac+xr? = xy’ — Box’ ds 
0 
L 
s [by - bax'ias 
0 
L 
= | Vex —Bx? as 
0 
L 
= i x2y/? — Ixy! Box’ + By?x” ds 
0 
L 
< ft + eryx?+y%ds since xx’ + pay'F 2 0 
0 
L 
= / x? + By” ds since a has unit speed 
0 


L 
= [ words 
0 
L 
= i rds since B(s) = (x(s), Vr? — x(s)?) 
0 
=rL. 


The usual arithmetic-geometric-mean inequality implies that / Ac m r2 < (Ac + r’)/2, so we 
obtain 


An Ac < L? 


and this is the isoperimetric inequality. Of course, we must still consider the case of equality, so 
suppose that 42 Ac = L*. Then, by following backwards through the estimates above, we see 
that all of the inequalities are, in fact, equalities. In particular, the equality 


Actarr? rb 
VA 2— — = 
cur 2 2 


says that Ac = 2 r* and, consequently, L = 27 r. Furthermore, because the integrals above are 
equal and the respective integrands positive, we must also have 


((x, Bo): (y, —x)P = (xy! — fox’? =r? 


where - denotes the dot product. But [(x, 62)| =r, |(y’, —x’)| = 1 and (x, Bo): (y’, -x) = 
|(x, B2)| \(y’, —x’)| cos(@), so we must have cos(@) = 1. Hence, @ = 0 and (x, B2) and (y’, —x’) 
are in the same or opposite directions. The lengths of the vectors then imply that (x, B.) = 
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tr (y’, —x’). Therefore, x = tr y’ and B) = =r x’. Using B. = Vr? — x2, the second equation 
gives 


using x = r sin(0); 


x(s) =r sin (= +d). 


Now, x'(s) = cos(s/r + d) and x? 4 y? = |, so we can solve to get y(s) = =r cos(s/r +d). 
Thus, up to signs, the parametrization for C is that of a circle of radius r; 


a(s) = (r sin (= +d), r cos (= +d)). 


1.7. The Geometry of Curves and Maple 


Computer algebra systems provide powerful tools for computation and visualization in differential 
geometry. Applications range from simple computations such as numerical evaluation of an 
arclength integral to more complicated graphics such as drawing geodesics on surfaces. No 
matter the application, the interaction of human geometer with computer is a very personal thing. 
Therefore, in this section and all similar ones dealing with computers, I shall give (sometimes 
in the first person) my personal version of computers and geometry. I make no claims that the 
ways I have of doing things are optimal. In fact, I’m certain that many readers will have much 
better ways of approaching various problems. Nevertheless, I believe it is worthwhile nowadays 
to expose any prospective geometer to the world of computer algebra systems. In this text, I have 
chosen to use Maple 10 (hereafter known as Maple) because this is the system that I believe 
is the simplest for students to learn. There are now many introductions to Maple on the market 
and these should be consulted for comprehensive lists of commands and package descriptions 
(see [Red93] for example). For a Mathematica approach to geometry, the reader is referred 
to [Gra93]. 

Before we can really do things in Maple, we need some basic packages to be entered. The 
following commands do this. 


> with(plots) :with(LinearAlgebra) : 


Now we have certain special plotting abilities as well as tools of linear algebra for calculation. 
Much of what we do in geometry deals with vectors, of course, and the easiest way to enter them 
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in Maple is to write the following input. (Putting vertical lines between entries gives a row vector 
while putting commas between entries gives a column vector.) 


> A:=<alblc>; B:=<dlelf>; 
A= [a, b, c] 
B:= [d, é, fl 


The linear algebra operations that we will find most useful are the dot and cross products. 
While the cross product is obvious, Maple makes the ordinary dot product more obscure. The 
DotProduct command refers to a Hermitian inner product, so it is necessary to say “conjugate = 
false” to get the usual dot product in R’. 


> DotProduct (A,B,conjugate=false) ; 
da+eb+fe 


> CrossProduct (A,B); 
[b f —ce, cd—af,ae—ba] 


Maple is not only a command-line package, but a programming language as well. Programs in 
Maple are called procedures and, throughout this book, we will attempt to give procedures for 
the standard calculations arising in differential geometry. For example, the following procedures 
compute curvature and torsion of curves from the non-unit speed formulas of Section 1.4. These 
procedures assume that the curve is given parametrically in the variable ¢ in three coordinates. If 
the curve is a plane curve in the xy-plane, say, then we make the third coordinate zero. Note that, 
in the line right after “proc( )”, the word “local” appears. This tells Maple that the variables 
following “local” are to be considered only within the procedure (usually just for intermediate 
calculational purposes), so that later on in the worksheet we can make use of the local variables 
without them having previously defined values. Also, pay attention to the “Norm” command. 
The standard Maple norm is a complex oo-norm. To get our usual norm, we need to specify 
“Euclidean, conjugate=false”’. 


> curv:=proc(alpha) 

local alphap,alphapp; 

alphap:=map(diff,alpha,t) ; 

alphapp:=map(diff,alphap,t) ; 

simplify (Norm(CrossProduct (alphap,alphapp) ,Euclidean, 
conjugate=false) /Norm(alphap, Euclidean, conjugate=false) “3, 
symbolic) ; 

end: 

> tor:= proc(alpha) 

local alphap,alphapp,alphappp; 

alphap:=map(diff,alpha,t); 

alphapp:=map(diff,alphap,t); 
alphappp:=map(diff,alphapp,t); 

simplify (DotProduct (CrossProduct (alphap,alphapp) ,alphappp, 
conjugate=false) /Norm(CrossProduct (alphap, alphapp) , 
Euclidean, conjugate=false)“2,symbolic) ; 

end: 
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In both procedures above, Maple uses the command map(diff,alpha,t) to, in one step, differentiate 
each coordinate function of the parametrized curve “alpha”. The “map” command is used in many 
situations to apply an operation to a list of coordinates. Now, finally, here’s an example. Let’s 
take a helix. 


> hel:=<a*cos(t) |a*sin(t) |[b*t>; 
hel := [acos(t), asin(t), bf] 


> curv(hel); 
a 


b? +a? 
> tor(hel); 
b 
b? +a? 
In order to evaluate the curvature and torsion of a particular helix, it is not necessary to redefine 


the whole curve. The command “subs” allows substitutions to be made into a general definition 
as in the following. 


> heli:=subs({a=4,b=2},hel) ; 
hell := [4cos(t), 4sin(t), 21] 


We can plot the helix in R? using the command “spacecurve”. See Figure 1.24.Note that the 
option “orientation” orients the plot according to spherical coordinates 6 and @. 


> spacecurve(convert(hel1l,list) ,t=0..5*Pi, scaling= 
constrained, color=black,axes=framed, orientation=[40,77], 
tickmarks=(3,3,3]); 


30 


20 


Figure 1.24. A helix 
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0 44 0 


Figure 1.25. A helical tube 


Sometimes it is convenient to thicken up the plot by making a tube around the actual spacecurve. 
The command “‘tubeplot” does this as follows. See Figure 1.25. The options “shading” and 
“lightmodel” allow us to color a 3d-plot and to shine light on it using pre-designed schemes (e.g., 
“XY” and “light3” respectively). The “style” option allows us to obtain wireframe, hidden line 
and patch plots, for example. Here “patchnogrid” gives a plot with a solid patch, but no grid lines 
on the surface. The default style is “patch”. 


> tubeplot (convert (heli,list) ,t=0..5*Pi, radius=1.2,scaling 
=constrained, shading=XY, lightmodel=light3, style= 
patchnogrid,axes=framed,orientation=[40,77]); 


Here, hell is plotted as t varies from zero to 5x. The entries following the domain of t are 
plotting options which make the plot appear a certain way. For example, “scaling=constrained” 
makes distance along the axes scaled the same in each direction. 


Exercise 1.7.1. Plot the helix with options “scaling=unconstrained” , “thickness=1” and “‘color= 
blue”. For other options, type ?plot3d[options]. The command “spacecurve” is a curve version 
of Maple’s “plot3d” command which we shall use extensively when we plot surfaces. 


To plot plane curves, we can use the “plot” command. For example, let’s take the witch of 
Agnesi. There will always be some amount of trouble going from spacecurves to plane curves. 
For example, our curvature procedure is based on a curve having three coordinates. In order to 
use the procedure when we have a plane curve, we must add an artificial third coordinate equal to 
zero. Also, plotting a plane curve with Maple’s “plot” command requires exactly two coordinates. 
We can get around this trouble also by defining the witch with a fake third coordinate as follows. 


> witch:=<2*tan(t) |2*cos(t)72/0>; 
witch := [2 tan(t), 2cos(t)’, 0] 
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-6 -4 —2 0 2 4 6 
Figure 1.26. The witch of Agnesi 
> curv(witch) ; 


7 (4cos(t)? — 3) cos(t)* 
(—4 cos(t)® — 1 + 4cos(t)8) /4 cos(r)® + 1 — 4 cos(r)8 


However, you cannot say “plot(witch, ...)” as we did for “spacecurve” for two reasons. First, 
since we’ve used three coordinates, Maple’s “plot” command will not work. Second, even if 
we use two coordinates to define the witch, Maple’s “plot” command uses an annoying syntax 
which will not allow an input such as “plot(witch,t=-1.3..1.3);”. You must say (in this exact 


form) 


> plot([2*tan(t) ,2*cos(t)°2,t=-1.3..1.3], scaling=constrained, 
axes=framed,color=red) ; 


It is also possible to write the following to produce Figure 1.26. 


> plot([witch[1] ,witch[2] ,t=-1.3..1.3], scaling=constrained, 
axes=framed,color=red) ; 


Notice that, unlike “spacecurve”, the square brackets must go around the t = ... as well as the 
parametrization. 


Exercise 1.7.2. Plot the Jemniscate r? = a* cos(2@) using the parametrization (after fixing a to 
be some value) 


a(t) = (a(cost)./cos(2r), a(sint) /cos(2r)). 


Now look in Maple’s Help to see how to plot in polar coordinates. 


We can use Maple procedures to graph more complicated curves. For example, the second 
procedure below takes input a curve and gives output the involute of the curve. The first procedures 
gives the arclength of a curve from 0 to t. 


> arce:=proc(alpha) 

local alphap; 

alphap:=map (diff ,subs(t=u,alpha) ,u); 

simplify (int (Norm(alphap, Euclidean, conjugate=false) ,u=0..t), 
symbolic) ; 

end: 


1.7. The Geometry of Curves and Maple 47 


Notice that this procedure expects the curve “alpha” to be parametrized by “t” and that, at the 
very first stage, “t” is replaced by a dummy variable “u”. Finally, the length of the tangent vector 
in terms of “u” (i.e., the speed) is integrated to produce an arclength in “t”. Here is an example. 
First define the unit circle as a parametrized curve and then apply “arc”. 


> circ:=<cos(t) |sin(t)[0>; 


circ := [cos(t), sin(t), 0] 


> arc(circ); 


This is a verification that the unit circle above is parametrized by arclength. Using this arclength 
procedure, we can write a simple involute procedure. 


> involute:=proc (alpha) 

local a; 

a:=1/Norm(map (diff ,alpha,t) ,Euclidean, conjugate=false) * 
map (diff ,alpha,t); 

simplify (alpha-arc (alpha) *a, symbolic); 

end: 


For example, we have the following input and output. 


> involute(circ); 


[cos(t) + ¢ sin(t), sin(t) — t cos(t), 0] 


If you wish to graph the circle and its involute together, you can do this by creating a plot structure 
for each and then displaying both at once. See Figure 1.27. 


> cireplot:=spacecurve(convert(circ,list), t=0..2*Pi, 
color=black): 


> invplot:=spacecurve(convert (involute(circ), list) ,t=0.. 
2*Pi,color=black): 

> display({circplot,invplot },scaling=constrained,axes= 
framed, orientation=[-89,0]); 


Exercise 1.7.3. Show that the involute of the helix (cost, sint,t) is a plane curve. What plane 
curve is it? 


Similarly, we can write a procedure to find the plane evolute of a given curve. Recall 
that, given a(t), its evolute is defined by €(t) = a(t) + 1/«(t) N(t) where N is the normal 
ofa. 
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Exercise 1.7.4. Show that 
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4 3 2 1 0 1 


Figure 1.27. Involute of a circle 


Ler eared 


i i a 
: Rane Xa )xX a’. 


Hint: use the non-unit speed Frenet formulas. 


The exercise leads to the following procedure. 


> evolute:=proc (alpha) 

local alphap,alphapp,A,C; 

alphap:=map (diff,alpha,t) ; 
alphapp:=map(diff,alphap,t) ; 
A:=Norm(alphap, Euclidean, conjugate=false)~2/ 
Norm(CrossProduct (alphap,alphapp) ,Euclidean, 
conjugate=false)~2; 
C:=CrossProduct (CrossProduct (alphap,alphapp) ,alphap) ; 
simplify (alpha+A*C, symbolic) ; 


end: 


The following input and output gives the evolute of an ellipse. 


> ellipse:=<a*cos(t) [b*sin(t) |0>; 


ellipse := [acos(t), bsin(t), 0] 


> evolute(ellipse) ; 
— (—b? +a?) (—1+ cos(t)*) sin(t) (—b* + a*) | 
a : b : 


Notice that Maple simplifies the first coordinate, but not the second. In fact, although there 
does not seem to be a way of inducing Maple to simplify both coordinates simultaneously, the 
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second coordinate simplifies by hand to 
sin(t)?(b* — a’) 
, 
So the evolute of an ellipse is an astroid. 


Exercise 1.7.5. Show that the evolute of the catenary (¢,cosh(t)) is given by (t — cosh(t) 
sinh(t), 2 cosh(t)). Plot this evolute. 


Exercise 1.7.6. Do Exercise 1.4.16 by finding the evolute of the astroid (cos*(f), sin*(t)) 
using 
> astr:=<cos(t)73,sin(t)73,0>; 
cos(t)? 
astr := sin(t)> 


0 


and then following the instructions below. First take 


> evastr:=evolute(astr) ; 
—cos(t) (2 cos(t)? — 3) 
evastr:= | — sin(t) (2 cos(t)* + 1) 
0 


Take the astroid (2a cos3(t), 2a sin3(t)) (thought of as a column vector), multiply it on the left by 
the rotation matrix 


cos(z/4) —sin(2/4) 
sin(z /4) cos(1/4) 


and shift the phase of the rotated astroid by 7/4 as follows. 


> R:=Matrix([[cos(Pi/4) ,-sin(Pi/4)], [sin(Pi/4) ,cos(Pi/4)]]); 


V2 V2 
oe eo ee 
iltagee, 72 
a. POR 


> astr2:=subs(t=u-Pi/4,R.<2*astr[i], 2*astr[2]>); 


V2 sin(u + a + /2 cos(u + mi 
astr2 := 


J2sin(u + ui — /2cos(u + =) 
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Figure 1.28. Astroid and its evolute 


> factor(simplify(expand(astr2[1]),trig)); 
factor (simplify (expand(astr2[2]) ,trig)); 


—cos(u) (2 cos(u)” — 3) 


sin(u) (2 cos(u)* + 1) 


Finally, compare your result to the expression for the evolute of the astroid above. Now plot both 
astroids on the same set of axes using the “display” command. See Figure 1.28. 


> plot({[astr[1] ,astr[2] ,t=0..2*Pi], [evastr[1],evastr[2], 
t=0..2*Pi]],scaling=constrained,color=black) ; 


Now, we come to the curve which seems to have prompted the invention of involutes and 
evolutes. In the 1600s, explorations of the world began in earnest by the seafaring nations of 
Europe. Of course, the problem of navigation arose and, while it was easy to fix latitude at sea 
(e.g., using Polaris), it was next to impossible to fix longitude. This caused many famous disasters. 
The simplest plan to find longitude was this: first, on each day of the year, note what time the sun 
was at its highest point (always called noon) in Greenwich, England and compile this information 
(to be taken along on ships); next carry a clock along on board ship set to Greenwich time; at 
sea, note the time that the sun is at its zenith (i-e., local noon) using this clock and compare to the 
compiled information. For example, if, on a certain day, the sun is at its highest point at 12:00 
pm in Greenwich, but at its highest point at sea at 1:00 pm on the Greenwich clock, then the 
ship would be at 15 degrees west longitude from the prime meridian through Greenwich. (Why 
does one hour of time equal 15 degrees here?) The problem with the plan was that, at the time, 
clocks worked by pendula and, while we often say that, for small oscillations, a pendulum’s 
period does not depend on the amplitude, this is no longer true for larger oscillations — the very 
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Figure 1.29. A cycloid 


type of swings which might be expected at sea! So, how could the period of a pendulum truly be 
independent of the amplitude? 

Christian Huygens found an answer in the 1670’s (or so). Huygens knew that a cycloid was 
a tautochrone (see Exercise 1.1.14), a curve with the property that a particle sliding down it 
(without friction) takes the same time to reach the bottom, no matter where it starts on the curve. 
So, if a pendulum bob could be induced to freely follow the path of a cycloid, its period would 
be independent of its amplitude. But how could the bob be induced to follow a cycloidal path? 
Before we say, let’s look at the involute and evolute of a particular cycloid. See Figure 1.29 for 
the cycloid. 


> cycloid:=<t+sin(t) |3-cos(t)>; 
cycloid := [t + sin(t), 3 — cos(t)] 
> plot ([t+sin(t), 3-cos(t) ,t=0..2*Pi]); 
> cycplot:=plot([t+sin(t), 3-cos(t), t=0..2#*Pi]): 


Now imagine unwinding a thread from the cycloid above with fixed end at the cusp on top. 
The path followed by the end of the thread is, of course, the involute of the cycloid. 


> cycinv:=involute(cycloid) ; 
cycinv := 


—t ,/2cos(t) + 2 — sin(t) 2 cos(t) + 2+ 4 sin (5) + 4sin G) cos(t) 
V2cos(t) + 2 


~3 /2eos(t) + 2 + cos(t) /2cos(f) +2 + 4sin (5) sin(t) 
4/2 cos(t) + 2 
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Figure 1.30. Cycloid and involute 


The reader may check by hand (since Maple simplification does not work well here) that the 
expressions above give another cycloid! 


> invcyc:=<t-sin(t) |1+cos(t) |0>; 


inveyc := [t — sin(t), 1 + cos(t), 0] 
Let’s plot the involute together with the original cycloid (see Figure 1.30). 


> invcycplot:=plot([t-sin(t) ,1+cos(t), t=0..2*Pi] ,color 
=blue): 


> display({cycplot,invcycplot}, scaling=constrained) ; 


Figure 1.30 shows that the pendulum bob will follow a cycloidal path if the pendulum itself is 
flexible and it peels off barriers in the shape of cycloids. To design the barriers, first choose a 
path (i.e., a particular cycloid) and, since it is to be the involute of the barrier cycloid, take its 
evolute. For instance, if we take the evolute of the path above, we get 


> evolute(invcyc); 


[t + sin(t), 3 — cos(t), 0] 


the original barrier cycloid. Unfortunately, Huygens’s plan didn’t solve the problem of longitude. 
For the full story, see [Sob95]. 

Now let’s use Exercise 1.5.13 to try to recreate a plane curve from its curvature alone. 
While the formula involves integrals which can only rarely be solved, we can still plot the 
resulting curve. The exercise shows that a plane curve f is determined by its curvature as 


follows: 
B(s) = (f cos @(u) du, - sind(u)al 
0 0 
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where 6(u) = ae «(t)dt. By the Fundamental Theorem of Calculus, we can transform these 
integrals into a system of differential equations to be solved numerically and plotted. The system 
is 

dé d d 

i K(s) ai = cos(8(s)) . = sin(O(s)). 

The result will be the unit speed curve with the specified curvature. As a first example, let’s 

take x = 5 itself. In Maple, we define the system and then use Maple’s differential equations 
solver to get a numeric solution. For details on its operation, type “?dsolve”. 


> beta(s)=[Int(cos(theta(u)) ,u=0..s), Int(sin(theta(u)), 
u=0..s)]; 


Ss 


B(s) = [[ costwyau, / sin(@(u)) du] 
0 0 
> theta(u)=Int (kappa(t) ,t=0..u); 
O(u) = [awa 
0 


> sys:=diff(theta(s),s)=s, diff(x(s),s)= cos(theta(s)), 
diff (y(s),s)=sin(theta(s)) ; 
d d d ; 
SYS = ra A(s)=s, is x(s) = cos(@(s)), as y(s) = sin(@(s)) 


> p:=dsolve({sys,theta(0)=0,x(0)=0, y(0)=0},{theta(s), 
x(s),y(s)},type=numeric): 


Notice that the numeric option in “dsolve” requires initial conditions as well as the system of 
equations. In fact, for certain equations, “dsolve” will return a solution in closed form from the 
system alone. The numeric solution may now be plotted using Maple’s “odeplot” command. See 
Figure 1.31. Note that “odeplot” takes as input the solution given by “dsolve” (here given the 
name “p” by using the “define symbol” : =) and the variables to be plotted. We could plot 
“{s,x(s)]” for example. Here we want the phase space description. 


> scurv:=odeplot (p, [x(s),y(s)],-5..5, numpoints=200) : 
> display(scurv,view=[-2..2,-2..2]); 


Exercise 1.7.7. Create the unit speed curve with «(s) = 1/(1 + s*). What is this curve? Can you 
prove it? 


Now let’s write a procedure to take an input curvature and output a curve with that curvature. 
Additional inputs give bounds on the curve’s parameter and viewing region. In particular, the 
plotting option “view=[c..d, f..g]” tells Maple to display only the region from c to d along the 
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Figure 1.31. Curve with x(s) =s 


x-axis and from f to g along the y-axis. The inputs “a” and “b” tell “odeplot” how the parameter 
“s” should vary. 


> recreate:=proc(kap,a,b,c,d,f,g) 

local sys,x,y,p,theta,pl; 

sys:=diff(theta(s),s)=kap(s), diff(x(s),s)=cos(theta(s)), 
diff (y(s) ,s)=sin(theta(s)); 
p:=dsolve({sys,theta(0)=0,x(0)=0,y(0)=0},{theta(s) ,x(s), 
y(s)},type=numeric): 

pl:=odeplot (p, [x(s) ,y(s)],a..b,numpoints=400,thickness=1, 
axes=framed,color=red): 

display (pl,view=[c..d,f..g] ,scaling=constrained) ; 

end: 


To specify a curvature function, give the parameter (f say) and use the hyphen and greater-than 
sign to construct an arrow -> to the formula defining the effect on t. Some examples of curvature 
functions are as follows. 


> kapl:=t->t; 
kap! i=t-t 


> kap2:=t->t72: 

> kap3:=t->exp(t): 

> kap4:=t->sin(t): 

> kap5:=t->sin(t)*t: 

> kap6:=t->sin(t)*t72: 
> kap7:=t->1/(1+t72): 
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Figure 1.32. Curve with x(t) = t sin(t) 


Now let’s apply the procedure to some example curvature functions. For the first, see Fig- 
ure 1.32. 


> recreate(kap5,-8,8,-2,2,0,3); 
> recreate(kap7,-10,10,-4,4,0,10); 


Exercise 1.7.8. Recreate curves with the curvature functions listed above. Also try “recreate(1,- 
5,5,-1.5,1.5,0,2);”. What should you get? 


We can also recreate curves in space from their curvature and torsion via the Frenet (differ- 
ential) equations. This then is a heuristic proof of the Fundamental Theorem of Space Curves 
(Exercise 1.5.17): curvature and torsion determine a curve up to a rigid motion in R?. The 
following procedure solves the nine Frenet equations together with the three equations defining 
the unit tangent vector of the curve and allows you to zoom in by defining a viewing region. The 
inputs “kap” and “ta” are the desired curvature and torsion and “‘a”,“‘b” bound the parameter of the 
solution curve while “c,d,e,f,g,h” give ranges for viewing along the x, y and z axes respectively. 
Finally, “theta” and “phi” give the orientation of the resulting picture. There is one thing to note 
however. We can input curvature (and torsion) functions which take both positive and nega- 
tive values to recreate curves while our formula for curvature always gives a positive function. 
Therefore, we must be careful in trying to make these procedures “inverses” of each other. 


> recreate3dview:=proc(kap,ta,a,b,c,d,e, f,g,h,theta, phi) 
local sys,p,x,y,Z,11,T2,T3,N1,N2,N3,B1,B2,B3,p1; 
sys:=diff(x(s),s)=T1(s) ,diff(y(s) ,s)=T2(s) ,diff(z(s),s)= 
T3(s) ,diff(T1(s) ,s)=kap(s)*N1(s) ,diff(T2(s) ,s)=kap(s)*N2(s), 
diff (T3(s) ,s)=kap(s)*N3(s), diff(N1(s) ,s)=-kap(s)*T1(s)+ 
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Figure 1.33. Recreated helix 


ta(s)*B1(s) ,diff(N2(s) ,s)=-kap(s)*T2(s)+ta(s)*B2(s), 
diff (N3(s) ,s)=-kap(s)*T3(s)+ta(s)*B3(s) ,diff(B1(s),s)= 
-ta(s)*N1(s) ,diff(B2(s) ,s)=-ta(s) *N2(s) ,diff (B3(s) ,s)= 
-ta(s) *N3(s) ; 

p:=dsolve({sys, x(0)=0,y(0)=0,z(0)=0,T1(0)=1,T2(0)=0, 
T3(0)=0,N1(0)=0,N2(0)=1,N3(0)=0,B1(0)=0,B2(0)=0, 
B3(0)=1},{x(s) ,y(s) ,z(s) ,T1(s) ,T2(s) ,T3(s) ,N1(s), 
N2(s) ,N3(s) ,B1(s) ,B2(s) ,B3(s)},type=numeric): 
pl:=odeplot (p, [x(s),y(s) ,z(s)],a..b,numpoints=200, 
thickness=1,axes=framed, color=red): 

display (pl,scaling=constrained,view=[c..d,e..f,g..h], 
orientation=[theta, phi] ); 

end: 


We can recreate a circular helix by taking x and t to be constants. For example, take « = 1/5 
and t = 1/10 with the solution curve parameter ranging from 0 to 200 to get a standard helix. 
See Figure 1.33. 


> recreate3dview(1/5,1/10,0,200,0,25, 0,20,0,40,49,49); 


Exercise 1.7.9. Use recreate3dview to recreate curves as follows (see Figure 1.34, Figure 1.35 
and Figure |.36). 


> kap3d1:=t->t; 
kap3dIl :=t—>t 


> recreate3dview(kap3d1,1/10,0,9,0.4,1.4,0.4,1.4,0.1, 
0.4,56,66); 


> tau3d1:=t->sin(t); 
tau3d] := sin 
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Figure 1.34. Recreated curve from x =f 
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Figure 1.35. Recreated curve from « = ¢ and t = sin(t) 


Figure 1.36. Recreated curve from x = cos(t) and t = sin(r) 
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> recreate3dview(kap3d1,tau3d1,0,7.8,0,1.5,0,1,0,1.2, 
73,75); 


> kap3d2:=t->cos(t) ; 
kap3d2 := cos 


> recreate3dview(kap3d2,tau3d1,0,60,-3,3,-2,4,-3.5,2, 
100,95); 


Exercise 1.7.10. Recreate the rectifying curves of Theorem 1.5.11. Hint: take, say, « = s* +25 
andt=s. 


We have seen in the re-creation of curves above how Maple’s “dsolve” command can be 
used to numerically solve differential equations. In fact, however, “dsolve” can often provide 
explicit solutions also. Let’s look at an example where we see how the Laplace transform can 
be used to determine the tautochrone of Exercise 1.1.14, the curve having the property that a 
bead sliding (without friction) to the bottom arrives in the same time no matter where it starts 
on the curve. Write the curve as x = x(y) since the potential due to gravity involves y. Suppose 
that the bead begins at a point (x, y;) on the curve and the bottom is at (0, 0). Conservation 
of energy says that we have mg y, = mvu”/2 + mg y at any point (x, y) along the slide, so the 
bead’s speed is given by v = \/2 g(y; — y). The arclength along an infinitesimal portion of the 


curve is given by ds = ,/ dey + 1dy and D = RT, so the total time it takes to the bottom is 


given by T = es ey + 1/,/2g(y; — y)dy. We want this to be a constant independent of y,. 


Write 


Gy 


Vist = [ =<); 


fOn) = 1/./1 and h(y1) = (& a + 1. This means that the integral is a convolution integral 
of f andh, so. /2gT = eric * h(y), where we have switched the variable to y for conve- 
nience. We will now apply the Laplace transform to both sides, ending up with the transform 
of h(y) alone. We will then use the inverse transform to find a condition on k which will deter- 
mine the curve x = x(y). The necessary Maple package is “inttrans” which stands for “integral 
transform”. 


> with(inttrans): 


Sometimes we have to tell Maple certain things to assume about a problem. Here is how to do 
that. 


> assume(T,constant); additionally(g,constant) ; 
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> laplace(sqrt(2*g)*T,y,s); 


V2 /gT 


> f£:=1/sqrt(y); 


It is fortunate that Maple knows how to take this Laplace transform, because it very difficult by 
hand. Indeed, it involves the I function and its properties. 


> laplace(f,y,s); 


“8 
AY 


Since ./2 g L(T) = —L(f) L(A), we can find L(A) by dividing as follows. 


> Lh:= simplify(laplace(sqrt(2*g)*T,y,s)/laplace(f,y,s), 
symbolic) ; 


_ v2 VET 
fas 


Therefore, h(y) can be found using the inverse laplace transform. 


Lh 


> h:=simplify(invlaplace(Lh,s,y) ,symbolic) ; 
pw V2NBT 
x Jy 


But we know that h(y) = J(eY + 1, so this gives us a differential equation for x = x(y). Here 
is how to solve for x = x(y). 


> solve(sqrt (diff (x(y),y)"2+1)= 
2° (1/2) *g* (1/2) *T/Pi/y* (1/2) ,diff(x(y),y)); 


—y@y—2e9T?)  J-y@? y—28T?) 
my my 


> simplify (1/Pi/y*(-y*(Pi*2*y-2*g+*T~2))* (1/2), symbolic); 
Jr2y—-2gT?1 
a Jy 


Note that Maple simplifies by introducing the complex number J = ./—1. This is just notation 
however. We now use “dsolve” to solve the differential equation. 
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> dsolve(diff (x(y),y)=1/Pi/sqrt(y)* 
(-(Pi*2*y-2*g*T~2))* (1/2) ,x(y)); 


V¥—-m?y+2gT? fy 
n 


x(y) = 


ee 
1 rao 
V—m? y?+2ygT? 


te FH Cl 


nm? /—-n?y+2gT? Sy 


This doesn’t give much information, so we substitute cleverly to see what is going on. 


eT? /(-m? y+2g¢T?)y- arctan 


> x(y)=Int(1/Pi/y*(1/2)* (2Q*g*T°2-Pi-2*y)* (1/2) ,y=0..y1); 


i Von y+2gT? oe ay 


x(y) = 


In order to change variables in an integral easily, we need to use the “student” package. 


> with(student): 


> x(y)=changevar (y=2*g*T*2/Pi*2*sin(w) “2, 
Int (1/Pi/y* (1/2) *(2*g*T°2-Pi~2*y)~ (1/2) ,y=0..y1),w); 


x(y) = 
2% Te 
aresin 2 
| 2JV2 /—2¢T? sin(wy +229 T? gT? sin(w) cos(w) ? 
0 gT? sin(w)* 
nz 


Let’s try to simplify the integrand. 


> simplify(2/Pi~3*2° (1/2) /(g*T"2/Pi°2* sin(w)~2)7(1/2)* 
(-24g4T"24sin(w) “2+2*g*T~2) ~ (1/2) *g*T° 2*sin(w) *cos(w) , trig) ; 


4/2 T? cos(w) g T? sin(w) cos(w) 
nm? ./—g T? (-1 + cos(w)?) 


Now, Maple is never perfect at simplifying. As humans, we can see that certain trigonometric 
identities hold, but Maple doesn’t always recognize them. We also note that cos(w) is positive 
here, but Maple won’t take the square root without us telling it to do so explicitly. In practice, we 


1.7. The Geometry of Curves and Maple 61 
must use our own skills with Maple’s to achieve the best results in simplification. 


> xy)=int (4¥*g*T*2*cos(w) *2/Pi72,w) ; 


1 
42¢T? | —cos(w)sin(w) + a 
iz 2 2 
x(y) = = wee Se 
> x(y)=combine (4¥*g*T*2/Pi72* (1/2*cos(w)*sin(w)+1/2*w) ,w) ; 
gT*sin(2w)+2gT? w 
a 


Therefore, we have a parametric description of the curve x = x(y) in terms of the parameter 
w (recall that y = 2g 7? sin(w)*/x? and that the half angle formula simplifies this to y = 
gT* (i —cos(2 w))/x?. Substitute 2 w = 6. 


> xtheta:=(g*T°2/Pi~2)*(sin(theta)+theta) ; 
ytheta:=(g*T*2/Pi~2)*(1-cos(theta)); 


_ gT*(sin() + 9) 
xtheta := a, 
_ gT? (1 —cos(6)) 
ytheta := poe aaa 


This is a cycloid! So Maple (with a little human intervention) discovers the tautochrone to be a 
cycloid. Let’s forget about the constant factors and plot the parametric curve in Figure 1.37 and 
Figure 1.38. 


> plot([theta+sin(theta) ,1-cos(theta), theta=0..4*Pi], 
scaling=constrained,view=[0..12,0..2] ,color=blue) ; 


> plot({thetatsin(theta) ,1-cos(theta), theta=Pi..2*Pil, 
scaling=constrained,view=[3..6.3,0..2],color=magenta, 
ytickmarks=4) ; 


Now let’s change course a bit by plotting and analyzing Viviani’s curve from 
Exercise 1.3.26. 


> viv:=<a*(1t+cos(t)) |a*sin(t) |a*2*sin(t/2)>; 


viv = le (1 4+ cos(#)), asin(t), 2asin (5) 


2 4 6 8 10 12 


Figure 1.37. The tautochrone 
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Figure 1.38. The tautochrone: a closer view 


We can plot the curve in two ways. Again, we can save plots and display them all at once. We 
take Viviani’s curve, a sphere and an intersecting cylinder. Viviani is the intersection! 


> vivi:=spacecurve(convert(subs(a=1,viv), list) ,t=-Pi..Pi, 
color=black): 


> viv2:=tubeplot (convert (subs (a=1,viv),list), t=-2*Pi..2*Pi, 
radius=0.05,color=black): 


> sphere:=plot3d([2*cos(u)*cos(v), 2*sin(u)*cos(v), 
2*sin(v)] ,u=0..2*Pi,v=-Pi/2..Pi/2,shading=XY, lightmodel 
=light2): 


> cyl:=plot3d([cos(u)+1,sin(u),v], u=0..2*Pi,v=-2..2, 
shading=Z,lightmodel=light1): 


Now we display the plots. Note that we use viv2 because “tubeplot” allows us to see the curve 
of intersection better. See Figure 1.39. 


> display({viv2,sphere,cyl },scaling=constrained, 
orientation= [-27,69]); 


Now let’s verify that Viviani’s curve is a spherical curve. We use the criterion for spherical 
curves given in Exercise |.3.23—Exercise 1.3.26. 


> kk:=curv(viv) ;tt:=tor(viv) ; 
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Figure 1,39. Viviani's curve as the intersection of sphere and cylinder 


We now check the criterion. We need the following to be a constant, 


> zzi=simplify(1/kk*2+(1/tt~2)* diff (1/kk,t)~2/Norm( 
nap(diff,viv,t) ,Euclidean,conjugate=false)~2,symbolic) ; 


9 
zzt=4a° 


Hence, Viviani’s curve ts truly spherical. 


Exercise 1.7.11. Show that the following curve is spherical. 


(a cos*(r), a sin(r) cos(r), a sin(r)) 


63 


In Exercise 1.3.15, curves of constant precession were discussed. The recreate procedures 
above may be used to create curves of constant precession by choosing «(s) = —a sin(bs) and 
t(s) = a cos(bs) for constants a > 0 and b. However, in [Sco95], an explicit parametrization for 
such curves is derived. Furthermore, it is shown that any such curve lies on a hyperboloid of one 


sheet (which we shall consider several times in this book) with the formula 


The explicit parametrization is given by 


. i 
i aire t= at 


2c(¢ — b) 2e(¢ + b) 
c+hb c—b 
vs) = *Snc=) cos({c — b]s) + ele +b) cos([c + b]s) 


a ie 
z(s) = — sin(bs) 
be 
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Figure 1.40. Constant precession curve with a = 4 and b = 3 


where c = Va* + b*. The following procedure graphs curves of constant precession on the 


associated hyperboloids. 


> constprec:=proc(a,b) 


local c,alpha,al,a2,hyp,pli,pl2; 


c:=sqrt (a7 2+b~2) ; 
al:=ctb; 
a2:=c-b; 


alpha:=[a1/(2*c*a2) *sin(a2*s)-a2/(2*c*al)*sin(al*s), 


-al/ (2*c*a2) *cos(a2*s)+a2/(2*c*al)*cos(al*s), 


a/(b*c) *sin(b*s)]; 


hyp: =([2*b/a~2*cosh(r) *cos(q) ,2*b/a~2*cosh(r)*sin(q) , 


2/a*sinh(r)]; 


pli:=tubeplot (alpha, s=0..2*Pi,radius=0.01,color=black, 


numpoints=200) ; 
pl2:=plot3d(hyp,r=-1..1 


»q=0..2*Pi) ; 


display ({pl1,p12},scaling=constrained,style=wireframe, 


shading=XyY) ; 
end: 


Let’s take a curve of constant precession with a = 4 and b = 3. See Figure 1.40. 


> constprec(4,3); 


Exercise 1.7.12. Graph the curves of constant precession with a = 15, b = 8 and witha = /6, 
b = ./2. Can you make a conjecture about when curves of constant precession are closed curves? 


See [Sco95]. 


Exercise 1.7.13. Graph the unit tangent vector T(s) (i.e., the tantrix or tangent indicatrix) for 
curves of constant precession with a = 15, b = 8 and witha = /6, b = J/2. Use “display” to 
graph T(s) on the unit sphere. Can you make a conjecture about the number of cusps of such 
curves? For a discussion of tangent indicatrices of spherical curves, see [Sol96]. 
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BESS 
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Figure 1.41. New York to Moscow Figure 1.42 Cleveland to Sydney 


Exercise 1.7,14. The following procedure draws the great circle arc between two points A and B 
on Earth and gives the distance between the points (assuming the radius of the Earth to be 3970 
miles). A and B are given by longitude and latitude coordinates. For example A=[100,25] means 
that the point A has longitude 100 degrees and latitude 25 degrees. We take positive longitude to 
be east of Greenwich, England and negative longitude to be west of it. We also assume that the 
Earth is a sphere with parametrization (see Chapter 2) (R cos(u) cos(v), R sin(it) cos(v), R sin(v)), 
R being the radius of the Earth. 

Explain every line of the procedure. This should be a good test of the vector calculus used in 


this chapter. 


» with(LinearAlgebra) :with(plots): 


> great_circle;=proc(placel ,place2,theta, phi) 

local pointi,point2,angle,A,B,sphere,circ,alpha,wnitcp, 
alpi,alp2; 

A:=map(evalf, [Pi/180*place1[1] ,Pi/180*place1[2]]); 
B:=map(evalf, [Pi/180*place2[1] ,Pi/180*place2[2]]); 
point1:=<cos(A[1])*cos(A[2])|sin(A[1])*cos(A[2])|sin(aA[2])>; 
point2:=<cos(B[1])*cos(B[2]) |sin(B[1] )*cos(B[2]) |sin(B[2])>; 
angle:=evalf (arccos(DotProduct (point1, point2, conjugate= 
false))); 

print(‘Distance is‘ ,3970*angle, ‘miles‘); 
unitcp:=1/sin(angle)*CrossProduct (point1,point2) ; 
alp1:=ScalarMultiply(pointi,cos(t)); 
alp2:=ScalarMultiply(CrossProduct (unitcp,point1),sin(t)); 
alpha:=alpitalp2; 

sphere :=plot3d([cos(u)*cos(v) ,sin(u)*cos(v),sin(v)],u=0.. 
2*Pi,v=-Pi/2.-Pi/2, shading=XY, lightmodel=light2, grid= 
(25,191): 

circ:=tubeplot (convert (alpha, list) ,t=0..angle,radius=0.01, 
color=navy): 
display({sphere,circ},scaling=constrained, orientation= 
[theta,phi]); 

end: 
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Here are some longitudes and latitudes (in that order): Cleveland (-81.68,41.48), New York 
(-73.94, 40.67), Chicago (-87.68, 41.84), Paris (2.30, 48.83), Moscow (37.62, 55.75), Atlanta 
(-84.42, 33.76), Toronto (-79.60, 43.67), Seoul (127.05,37.52), Beijing (116.35,39.90), Sydney 
(151.30, -33.90), New Delhi (77.22, 28.90), Rio de Janeiro (-42.70, -22.49). How far is it from 
New York to Moscow? London to Paris? Cleveland to Sydney? See Figure 1.4] and Figure 1.42. 


> great_circle([-73.94,40.67] , [37.62,55.75], -21,60); 
Distance is, 4680.738234, miles 

> great_circle([-81.68,41.48] ,[151.30,-33.9], -156,75); 
Distance is, 9566.027530, miles 


Surfaces 


2.1 Introduction 


We see examples of surfaces in everyday life. Balloons, inner tubes, cans and soap films provide 
physical models of surfaces. In order to study the geometry of these objects, what is required? 
Somehow, we must have coordinates to make calculations. But just because these surfaces reside 
in R? we shouldn’t let ourselves think of them as 3-dimensional. For instance, if we cut a cylinder 
lengthwise, then we can unroll it to lie flat on a tabletop. This indicates that surfaces are inherently 
2-dimensional objects and, therefore, should be describable by two coordinates. This gives us 
our first clue to an approach for describing the geometry of surfaces. Namely, we should try to 
spread part of the plane around a surface and, in terms of the required twisting and stretching, 
understand how the surface curves in space. The spreading provides coordinates to do calculus on 
a surface and various calculus-based quantities associated with the spreading describe the shape 
of the surface. Just as the calculus of the Frenet Formulas enabled us to describe the geometry of 
a curve, so also will the calculus of a surface enable us to describe the geometry of a surface. 

Let D denote an open set in the plane R*. The open set D will typically be an open disk or an 
open rectangle. Let 


x:D>R (u,v) + (x!(u, v), x?7(u, v), x3(u, v)) 


denote a mapping of D into 3-space. The x'(u, v) are the component functions of the mapping 
x. We can do calculus one variable at a time on x by partial differentiation. Fix v = vo and let u 
vary. Then x(u, vo) depends on one parameter and is, therefore, a curve. It is called a u-parameter 
curve. Similarly, if we fix u = uo, then the curve x(uo, v) is a v-parameter curve. Note that both 
curves pass through x(uo, vo) in R>. Tangent vectors for the u-parameter and v-parameter curves 
are given by differentiating the component functions of x with respect to u and v respectively. 


We write 
ax! ax? ax ax! ax? ax} 
x, ={—, —, — and x,» ={—, —, — ]}. 
du Oud dv. dv av 
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Figure 2.1. Parameter curves with tangent vectors 


2. Surfaces 


We can evaluate these partial derivatives at (uo, vo) to obtain the tangent, or velocity, vectors of 
the parameter curves at that point, x,(uo, Uo) and x,(uo, Uo) (see Figure 2.1). 


Of course, to obtain true coordinates on a surface, we need two properties. The first is that x 


must be one-to-one. In fact, as we will see later, we can relax this condition slightly to allow for 
certain self-intersections of a surface. Namely, we will allow those intersections which still give 
unambiguous normal vectors. Second, x must not “crinkle” parameter curves together so that x, 
and x, have the same direction. For, then, we lose 2-dimensionality — the very attribute of a 
surface we are trying to describe. The first property is verified by simple algebra or imposed by 
decreasing the size of D. But how can we show that the vectors x, and x, are never linearly 
dependent so that the second property holds? The following exercise provides a simple test. 


Exercises 2.1.1. Show that x, and x, are linearly dependent if and only if x, x x, = 0. Recall 
that two vectors are linearly dependent if one is a scalar multiple of the other. 


A mapping x: D > R? from an open set D C R? is regular if x, x x, # 0 for all points in D. 


(Of course, x, x X, must exist as well!) A coordinate patch (or parametrization) is a one-to-one 
regular mapping x: D — R? of an open set D C R? to R?. A surface in R? isa subset M C R? 
such that each point of M has a neighborhood (in M) contained in the image of some coordinate 
patchx: D> MCR’. 


Because each patch (e.g.,x, y, zZ) ona surface is one-to-one (at least in a smaller open portion 


of the original domain), then the respective inverse functions exist (and, in fact, are continuous). 
We may therefore consider the composition of any pair of patches x! o y: D > R*. Sucha map 


is differentiable (or smooth) if all partial derivatives 2 


2 # 8 
au? du2? du? 


- of each component function 


exist and are continuous. A surface is differentiable (or smooth) if x—! oy is differentiable for any 
pair of patches on the surface. The surfaces we deal with will generally be smooth or close to it. 
That is, there may be a finite number of points which must be discarded to achieve smoothness. 
In virtually all of our examples, a surface M will be defined by a single patch with the exception 
of only a finite number of points. Indeed, we mention the notion of overlapping patches only 


because we use the idea later in our discussion of isometries and Gauss curvature. 
A function f: M — R from a surface M is differentiable (or smooth) if the composition 


fox: D — R is smooth for each patch x on M. A curve on a surface is simply a mapping 
from an interval of real numbers J = [a, b] C R to the surface M,a: I — M. The surface M is 
said to be path connected if, for any two points p, q € M, there is acurve a: [0,1] — M with 
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a(0) = p and a(1) = q. In this book ail surfaces, unless explicitly mentioned otherwise, will be 
assumed to be path connected. This also implies the weaker property that surfaces are connected; 
that is, a surface M can be written as a disjoint union of open sets M = A U B only when either 
A or B is empty. 


Exercise 2.1.2. This exercise is for those with some knowledge of topology. It will be used in 
Theorem 6.7.7. Suppose M is connected. Show that a subset Z C M which is both open and 
closed must be either M or @. 


A curve is differentiable (or smooth) ifx~! 0 a: I N a~'(x(D)) > R? is smooth for all patches 
x on M. The following lemma shows one reason why smoothness of surfaces is important for us. 
Namely, in order to understand curves on surfaces, it will be sufficient to understand parametric 
curves back in the open domains of R? rather than in the surfaces themselves. 


Lemma 2.1.3 (Important Lemma). Let M be a surface. If a: 1 > x(D) CM is a curve in 
R? which is contained in the image of a patch x on M, then for unique smooth functions 
u(t), u(t): ! > R, 


a(t) = x(u(t), v(t). 


Proof. Because a is smooth, the composition x7! 


D C R?,so x7! oa(t) = (u(t), v(t)). Hence 


a(t) = x(x7! o a(t)) = x(u(2), v(t). 


In order to see that u(t) and v(t) are unique, suppose a = x(u(t), v(t)) for two other functions i 
and v. Note that we can assume that u and @ are defined on / as well by reparametrizing. Then 


oa: I > D is smooth by definition. Now, 


(u(t), v(t)) = x7! oa(t) = x7! o x(a(t), T(t) = (A(t), (1)). 
O 


To study a curve on a smooth surface now, we can look at the functions of one variable u(t) 
and u(t). Also, note that, if a curve @ on M is not contained in a single patch, then we may 
study it by considering the parts of it which lie in separate patches and then piece the information 
together. Again we note that our examples shall deal only with curves contained in a single patch. 
We are sometimes interested in closed curves on surfaces. These are curves a: [a,b] > M with 
a(a) = a(b) and a(a) = a”(b) for all order derivatives. This is equivalent to saying that the 
curve may be represented by a smooth map of the circle S$! into M. This leads to the following. 

Suppose M: x(u,v) and N: y(r,s) are two surfaces defined by single patches. A mapping 
F: M — N simply associates a point of N, unambiguously, to a point of M. We write F(p) = q 
for p € M and q € N. A mapping F is differentiable (or smooth) if the composition y~! o F o 
x: Dy — Dy is smooth as a map of open subsets of R?. That is, the usual partial derivatives must 
all exist and be continuous. Of course, we may make the same definition for surfaces covered by 
many patches. We simply require that all such compositions are smooth. Mappings of surfaces 
may be quite complicated as we shall see later. We shall also see, however, that linear algebra 
allows us to get a reasonable linear approximation of the mapping that is quite understandable. 


70 2. Surfaces 


Before we give examples of patches, we need to introduce one last preliminary notion which 
often allows us to prove things. A surface M C R? is compact if it is closed and bounded. The 
meaning of “bounded” is clear; namely, M is bounded if it may be completely enclosed by a large 
enough sphere in R?. A subset M C R? is closed if any convergent sequence x; > x in R? with 
each x; € M must also have ¥ € M. Below, we shall see examples of compact and non-compact 
surfaces. The most important properties related to compactness are the following. 


Proposition 2.1.4. We have the following properties: 


(1) If F: M > N is a map of surfaces with M compact, then F(M) is compact in N as well. 


(2) If f: M > R is a smooth function and M is compact, then f attains its maximum and 
minimum at some points of M. 


Examples of Patches (or Parametrizations) on Surfaces 


Example 2.1.5 (The Monge Patch). 
The graph of a real-valued function of two variables z = f(x, y) is a surface in R>. To see this, 
define a patch by 

x(u, v) = (u, v, f(u, v)) 


where u and v range over the domain of f. Then, x, = (1, 0, of) and x, = (0, 1 
on, when convenient, we shall denote partial derivatives of functions by 


of 
, 3,): From now 


———=-fi, hee 


The patch is regular, as can be seen from 


ij k 
wXxX=/1 0 ffJ=-—h fh, 1 #9. 
01 ft 


As an example, consider the paraboloid z = x* + y*. A Monge patch (see Figure 2.2 and 
compare Figure 2.3) for this non-compact surface is given by 


x(u, v) = (u, v, wt v’). 
Exercises 2.1.6. What are the parameter curves for the paraboloid when uy = 0, vp = 0? 
Exercise 2.1.7. Find a patch for the cone z = ,/x? + y?. Does something go wrong here? 


Exercises 2.1.8. Find a Monge patch on (part of) the unit sphere x? + y? + z* = 1. Why can’t 
a Monge patch be defined on all of the sphere? 


Example 2.1.9 (Geographical Coordinates). 
Let M be a sphere of radius R (centered at (0, 0, 0) for convenience). Geographical coordinates 
make use of two angles and a radius from the center (which in our case is fixed at R). For a point 
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Figure 2.2. A (Monge) patch Figure 2.3 A (non-Monge) paraboloid 


a distance R away from the origin, draw a line segment from (0, 0, 0) to the point. The parameter 
u, which varies as —7/2 < uv < 7/2, measures the angle (in radians) from this line down to the 
xy-plane. Ifthe line is projected onto the xy-plane, then w (with 0 < u < 277) measures the angle 
of the projection from the positive x-axis. See Figure 2.4. Consider the following triangles (see 
Figure 2,5). Hence, the xyz-coordinates of the point are 

(Rceosucosv. Rsinucosv. Rsinv) = x(u. v) 
with 

x, =(—Rsinucosv, Reosucosv, 0), 

x, =(—Reosusinv, —Rsinusinv, Reos v), 


Further. we compute the cross product to be 


? , ae ? P 
X, XX» =(R* cosu cos? v. R? sinucos* v, R® sinv cos v). 


Figure 2.4. Geographical coordinates 
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R R cos (v) 
R sin (v) 


R cos (v) R cos (v) sin (wz) 


Figure 2.5. Projection triangles 


where, in the 3 coordinate, we use sin? u + cos? u = 1. Note that |x, x x,| = R? cos v. From 
now on, we shall use the notation S?(R) for the sphere of radius R centered at the origin and 
simply S? for the sphere of radius 1 centered at the origin. Note that spheres are compact. 


Example 2.1.10 (Surfaces of Revolution). 

Suppose C is a curve in the xy-plane and is parametrized by a(u) = (g(u), h(u), 0). Revolve C 
about the x-axis (see Figure 2.6). The coordinates of a typical point P may be found as follows. 
The x-coordinate is that of the curve itself since we rotate about the x-axis. If v denotes the angle 
of rotation from the xy-plane, then the y-coordinate is shortened to y cos v = h(u) cos v and the 
z-coordinate increases to h(u) sin v. See Figure 2.7. Hence, a patch may be defined by: 


X(u, v) = (g(u), h(u)cosv, h(u)sinv). 


ee 
ALA 
ae ae 


Figure 2.6. A surface of revolution 


A(u) sin (v) Aw) 


h(u) cos (v) 


Figure 2.7. Surface of revolution triangle 
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If we revolve about a different axis, then the coordinates permute. For example, a parametrized 
curve (/h(u), 0, g()) in the xz-plane rotated about the z-axis gives a surface of revolution 
(A(w) cos v, A(u)sinv, g(u)). 


Exercise 2.1.11. Check that 


Why is x, = x, #0 forall u, v? 


In general, g(u) measures the distance along the axis of revolution and h(w) measures the distance 
from the axis of revolution. The nicest situation is, of course, when g(u7) = x, for then it is easy 
to see where you are on the surface in terms of the parameter u. 


Exercise 2.1.12. Find a patch for the catenoid obtained by revolving the catenary y = cosh(.x) 
about the x-axis. See Figure 2.8. 


Figure 2.8. A catenoid 


Exercise 2.1.13 (The Torus), Consider the diagram of the torus, Figure 2.9. Revolve the circle 
of radius r about the z-axis. Show that a patch is given by 


x(a, v) = ((R+rcosu)cosu, (R+rcosu)sinu, rsinu). 


Figure 2.9. A torus of revalution 
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Figure 2.10. A helicoid Figure 2.11 Enneper’s surface 


How do u and v vary? Note that the torus is compact. 


Example 2.1.14 (The Helicoid), 

Take a helix a(u) =(acosu, asinu, bu) and draw a line through (0,0, bu) and (acosu, 
asinu, bu). The surface swept out by this rising, rotating line is a helicoid. The line required is 
given by (0. 0, bu) + v(acosu, a sinu, 0), so.a patch for the helicoid (see Figure 2.10) is given 
by 


x(u, Uv) =(avcosu,ausing, bu). 
Exercise 2.1.15. Show that x(u, v) is a patch by verifying that |x, * x,| + 0. 


Exercise 2.1.16 (Enneper’s Surface). Define a surface by 


2 yp 
2 ? ? 
x(u,v) = (« = ry +uuy, v— 3 + vu, i= ’) . 


Show that this definition gives a regular patch (see Figure 2.11). Then show that, for u? + 
v? <3, Enneper’s surface has no self-intersections. Hint: use polar coordinates u« = r cos@, 
v =rsin@, show that the equality x? + y? + 32 = 4r?(3+r7) holds, where x, y and z are 
the coordinate functions of Enneper’s surface, and then show that the equality implies that 
points in the (uv, v)-plane on different circles about (0,0) cannot be mapped to the same point. 
Finally, find two points on the circle u* + v7 =3 which do map to the same point of the 


surface. 
A surface is ruled if it has a parametrization 


x(u, v) = B(u) + vd(u) 
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where f and é are curves. That is, the entire surface is covered by this one patch which consists of 
lines emanating from a curve 6(u) going in the direction 5(u). The curve A(u) is called the directrix 
of the surface and and a line having 4(™) as direction vector is called a ruling. A surface is doubly 
ruled if it has two different ruled patches x(u. v) = A(u) + vd(u) and y(u, Vv) = &(u) + vy(u). 
Ruled surfaces may have points such that x, x x, = B’(u) x 5(u) + vd"(u) x 8(u) = 0. These 
points are well-controlled, as Exercise 2.1.24 below shows. We have two very well known 
examples of ruled surfaces. 


Example 2.1.17 (Cones). 
x(U, V) = p + vd(u) where p is a fixed point. 


Example 2.1.18 (Cylinders). 
x(u, v) = B(u) + vg where q is a fixed direction vector. 


Exercise* 2.1.19, Find patches for the standard cone z = ,/ x? + y? and standard cylinder x? + 
y* =1 which are ruling patches in the sense of Example 2.1.17 and Example 2.1.18 above, 


This explains why the names cone and cylinder are used for the more general patches given 
above. 


Exercise 2.1.20. Show that the saddle surface z = xy is doubly ruled. See Figure 2.12 
Exercise 2.1.21. Show that the helicoid is a ruled surface. 


Exercise* 2,1,22. The hyperboloid of one sheet (see Figure 2.13) 


x “ yo iz 
a Be 


=) 


has a useful parametrization 


x(u, v) = (acoshucosv, bcoshwsinv, csinhu), 


Figure 2.12. A saddle Figure 2.13 A hyperboloid of one sheet 
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However, this is not a ruling patch. Find two (related) ruling patches for this surface. That 
is, show the surface is doubly ruled. Hint: A directrix is B(u) =(acosu, bsinu, 0). Let 
5(u) = B’(u) + (0, 0, c). Because they are ruled surfaces, hyperboloids of one sheet are use- 
ful in engineering. For example, one reason nuclear cooling towers are built in the shape of 
hyperboloids of one sheet is that they may be built with straight beams along the rulings (see 
[Whi84]). Therefore, stress on the tower is small (i.e., there is no internal bending) and the tower 
is easy to build. What do you think another reason is for using such shapes? “Venture” a guess. 
Also, as another example, look at the weave in the base of a wicker chair with a seashell back. 
Why would a ruling patch be used there? 


Exercise 2.1.23. Show that another patch for the hyperboloid of one sheet is given by 


- 1 — | 
xu.) = (a v b +uv ot ): 


utv’ utv’ utuv 


In particular, take a = b = c = | corresponding to x? + y? — z* = | and try plotting this para- 
metrization with Maple to see what problems you can run into. 


Exercise 2.1.24. Suppose M: x(u, v) = B(u) + vd(u) is a ruled surface with |6’| = 1 and [6] = 
1. Also suppose 8’ # 0, so that M is non-cylindrical. Show that M may be reparametrized by 
y(u, w) = y(u) + wd(u), where y’ - 5’ = 0. Note that y may not be unit speed. A curve such as 
y is called a line of striction for M. Show that any point on M where x, x x, = 0 must lie on 
the line of striction. Hint: write y(u) = B(u) + r(u)S(u), use y’ - 5’ = 0 and solve for r(u). Let 
w=u-—r(u). 


Exercise 2.1.25. Find the lines of striction for the helicoid and the hyperboloid of one sheet. For 
the latter, assume for convenience thata =b=c= 1. 


Exercise 2.1.26. Suppose M: x(u, v) = B(u) + vd(u) is a ruled surface. Show that M may be 
reparametrized by y(u, w) = €(u) + wd(u), wheree’ - 5 = 0. Hint: write e(u) = B(u) + s(u)d(u), 
use €’-5 =0 and solve for s(u). Your answer may be an integral. We will use this result 
in the proof of Catalan’s Theorem later. Find such a parametrization for a cylinder x(u, v) = 
B(u) + vq. 


The patches defined in this section provide a variety of examples of surfaces on which we will 
later test our notion of curvature (and its effect on geometry). We must note several important 
points, however, before we go on. 

(1) Occasionally our patches miss one or several points on a surface. This defect in a particular 
defining patch usually means that a single patch is not really sufficient to completely define the 
surface. In general, many patches may be required to cover a surface. Ask why, for example, in 
our patch on the sphere, the North and South poles are missing. 

(2) Other easily checked conditions also give us surfaces. For example, if we have a level 
set g(x, y,z) =c, then the Implicit (or Inverse) Function Theorem (IFT) may be invoked to 
show that the nonvanishing of the gradient of g, Vg = ( a ze, 38), at all points of the 
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level set, ensures that the level set is a surface. Essentially the IFT constructs a patch, so, in 
some sense, what we have done is more general. Furthermore, the “patch approach” allows 
much cleaner calculations later on. Thus, we have emphasized patches over other conditions for 
surfaces. 


Exercise 2.1.27 (Foreshadowing Exercise). In the plane our standard coordinate system uses 
the vectors e; = (1,0) and e2 = (0, 1) as x, and x, since the parameter curves are horizontal 
and vertical lines respectively. We have e; -e; = 1, e; -€2 = 0 and ep - e2 = 1. Use the notation 
E =x, -x,, F =x, -+x,,G =x, -x, and compute E, F, and G for all the patches given in this 
section. What is the same and what is different when a specific patch is compared to the plane 
itself? In particular, what does F tell you? 


2.2 The Geometry of Surfaces 


Now that we have an idea of what a surface is, how do we detect its geometry? One of the most 
important techniques in mathematics and, indeed, all of the natural sciences, is that of linear 
approximation. By this we mean the following. We recognize that the nonlinear or curved object 
at hand is too complicated to study directly, so we approximate it by something linear: a line, 
a plane, a Euclidean space. We then study the linear object and, from it, infer results about the 
original curved object. Of course this is exactly what we do when we attach the linear Frenet 
frame to a curve. This process also is useful in subjects ranging from differential equations to 
algebraic topology. 

The question is, of course, what type of linear space may be used to approximate a surface? 
Just as, in one-variable calculus, we use a tangent line to approximate a curve near a point, so 
we can use a tangent plane T,(M) to approximate a surface M near p € M. A plane consists of 
vectors and the natural idea for the make-up of 7,(M) is to use the vectors which arise as velocity 
vectors of curves on M. 

Formally, say that a vector v, € T,(M) is tangent to M at p if v, is the velocity vector of 
some curve on M. That is, there is some a: J — M with a(0) = p and a’(0) = vp. Usually we 
write v instead of v, when no confusion will arise. Then, the tangent plane of M at p is defined 
to be 


T,(M) = {v | v is tangent to M at p}. 


Immediately we know two curves which pass through p = x(uo, vo): the u and v-parameter 
curves with velocity vectors x, and x,. The following result says that every tangent vector is 
made up of a (unique) linear combination of x, and x,. Hence, {x,, x,} forma basis for the vector 
space T,(M). (Recall that this is what the term “basis” means.) 


Lemma 2.2.1. v € T,(M) if and only if v =A,x, +A2X»y, where X,, Xy are evaluated at 
(uo, v9). 


Proof. First suppose that @ is a curve with a(0) = p and a’(0)=v. We saw before that 
a(t) = x(u(t), v(t)), so the chain rule gives a’ = x,(du/dt) + x,(dv/dt). Now, a(0) = p= 
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x(u(0), v(0)), so u(0) = ug and v(0) = vo (since x is one-to-one) and 


d d 
v =a'(0) = x, (uo, vo) (0) + x,(uo, vo)-(0). 


Hence, A, = (du/dt)(0) and Az = (dv/dt)(0). 
Now suppose v = A;x, + A2x, (where x, and x, are evaluated at (uo, vg)). We must find a 
curve on M with a(0) = p and «’(0) = v. Define this curve using the patch x by 


a(t) = x(up + fA1, vo + fA2). 
Then w(0) = x(uo, vp) = p and 


, d(ug +12 
a(t) = xX,(uo + tA, vo + nye) 
d(v9 + thr2) 


dt 
= X,(Uo + fA], Vo + fA2)Ay) + Xy(Uo + LAI, Vp + LA2)A2. 


+ Xy(uo + fA), Vo + fA2) 


Thus, a’(0) = x,A; + XyA2. O 


How do we identify the tangent plane via an equation? The same idea works for any plane 
in R?. A plane is determined by a point p and a normal vector (at p) to the plane N. Any q is 
in the plane exactly when g — p is perpendicular to N. We then have, N -(q — p) = 0 or, for 
q = (x, y, 2), Pp = (Xo. yo. 20) and N = (a, b,c), 

ax + by +cz—[axo + byp + czo] = 0 

ax+by+cz+d=0 
where d = —[axg + byo + czo]. This is, of course, the usual equation of a plane. In fact, we 
already know how to get a normal vector for T,(M). Since {x,, x,} forms a basis for T,(M), 
we need only find a vector perpendicular to both x, and x, (and hence to any v = A,x, + A2Xy). 


Such a vector is given by the cross product x, x x,. Therefore, if x is a patch for a surface with 
X(Uo, Uo) = p, then a normal NV for T,(M) (see Figure 2.14) is given by 


N = x,(uo, Vo) X Xy(Uo, Vo)- 
Example 2.2.2 (A Function of Two Variables). 
Let M be the graph of a function z = f(x, y). We have a Monge patch x(u, v) = (u, v, f(u, v)) 


with x, x x, =(—fa. —fi, 1). If p = (uo, vo, fo, vo)), then the equation describing 
T,(M) is 


— fu(uo, vo)(x — uo) — fo(uo, vo)(y — vo) + (2 — fo, vo)) = 9. 
For instance, if z = x? + y*, then we have, for p = (x0, yo, 5a + ya), 


—2x9(x — xo) — 2yo(y — yo) + (z — (x2 + y@)) =0 
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Figure 2.14. Surface with normal 


which reduces ta 
2xox + 2yoy —2 =x) +B. 
Specifically, for p = (1,0, 1), we have 2x — z = | as the equation of the tangent plane. 


Example 2.2.3 (The Sphere of radius R). 
Parametrize the sphere x* + y* + z? = R? by geographical coordinates, Recall that the normal 
may be written, 


? 2 7 « % « 
X, X X, = (R* cos u cos? v, R* sinu cos* v. R? sin v cos v). 


(Note that x, x x, = (R cos v) p,a multiple of the point p the sphere. This verifies that the radial 
vector from the origin to the sphere meets the sphere at an angle of 2/2.) Suppose p = (R, 0, 0), 
so that u = 0 and v = 0. Then x, x x, at p is (R?, 0, 0) and T,(M) is given by 


(R?, 0,0): (x —R,y,z) =0 
R?x = R? 
x= R, 


This is, of course, the plane which is tangent to the sphere at (&, 0, 0) 


In fact, we shall rarely be interested in the equation of the tangent plane — ir is the normal 
itself which will detect the geometry of the surface. \f we know how the direction of the normal 
to a surface changes as we move in the direction determined by a tangent vector, then we will 
know how the surface itself curves in that direction. There is a point which must be made here. 
When a surface requires more than one patch to cover it, there is the possibility that the normals 
defined from the various patches are incompatible with each other in a fundamental way. We refer 
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Figure 2.15. A Mobius strip 


the reader to [dC76, p. 102] for details. Such a surface is said to be non-orientable and the first 
example is 


Example 2.2.4 (The Mobius strip). 
Consider the regular parametrization 


x(u, v) = ((2 —v sin(5)) sinu, (2- v sin ()) cosu, Vv COs (5)) 


for0 <u <2 and —1 <v < 1. This patch omits the interval where u = 0, so another patch is 
needed. This patch may be given by y(#, 0) = 


2-—d si et 7 2—v sin ert: sin, ¥ COs are 
D sin 413 cos, v 4735 u,ove AS 


which omits % = 0 corresponding to u = 2/2. No matter how we try to arrange it, these patches 
lead to the conclusion that the unit normal (of x say) must be equal to its negative! A picture of 
the Mobius strip (Figure 2.15) indicates how a normal may reverse itself as it travels around a 
closed curve. 


We have been rather vague about this notion of orientability since it will rarely come up in the 
rest of the book. In particular, a single regular patch x is always orientable since an explicit unit 
normal 


Xy X Xy 


_ |x, X X,| 


exists at each point. (Note that switching the order of x, and x, gives —U at every point of 
the patch.) This leads to the general definition that a surface is orientable if it has a smooth 
vector field of unit normals defined everywhere on it. Except for isolated examples such as 
Henneberg’s surface, which has a Mobius strip contained in it, the surfaces we deal with will be 
orientable. 
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Now let’s return to our discussion of how the change in unit normal detects geometry. In order 
to see how we might calculate this change in a given direction, first let us see how a tangent 
vector acts to change a function. Let g(x, y, z) be a function and let a(t) = (a! (1), a7(t), @7(t)) 
be a curve. Then the chain rule of multi-variable calculus gives 


d dgdx dgdy dgdz 

—(g(a(t)) = = + SS 4 

dt dx dt dydt dzdt 
_ ag da! dgda? dgda? 
~ ax dt ay dt az dt 
_ (9g ag ag da’ da* da? 
~ \ax’ ay’ az dt’ dt’ dt 
= Va(a(t)) - a(t) 

where Vg al (2, se sR). Now let v € T,(M) and suppose g is restricted to M C R?. We might 


ask how g changes on M in the v direction. But the v-direction on M itself means in the direction 
of a curve a with a’ = v at p. The directional derivative of g in the v-direction is 


ef a 
vieKp) = 5 (ala) lr=0= Val) ¥. 


where a(0) = p € M and a’(0) =v. (Note that the last equality shows that the directional 
derivative does not depend on the curve chosen through p with velocity vector v.) Hence, v 
acts on a function g: M — R to produce a scalar v[g]. This is why the notation v[g] is chosen 
to denote directional derivative. Notice that the usual directional derivative of multi-variable 
calculus has precisely the same definition, but the given function g may not be restricted to a 
surface M and the direction vector v may be any vector of R?. In many cases v may denote a 
vector field on M — a smoothly varying choice of tangent vector at some collection of points 
p © M — and then, v[g] gives a new function on M defined by v[g](p) = v,[g]. (Here we 
say that a vector field v smoothly varies on M if, for any curve a: J — M, the assignment 
t b+ v(a(t)) is differentiable.) 

If f isa function on M, then the composition with a patch x may be formed, f(x(u, v)) = f ox, 
so that on the piece of M given by the patch, f is a function of u and v. Hence we may 
write 


of | a(f ox) 
du-—sé 


and similarly for v. Then, since 4 (x(u, vg)) = X,, we have 
d a 
xu] = (FCW, ¥0)) leetn = — lent 
du au 


and this works for all f. That is, a u-parameter velocity vector applied to a function gives the 
u-partial derivative of the function (composed with the patch). 


Exercises 2.2.5. Show the Leibniz Rule (or Product Rule) holds. That is, for v € T,(M), we 
have orientable. v[ fg] = vifle(p) + f(p)vIgl. 
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Exercises 2.2.6. Show v[x] = v!, v[y] = v’, v[z] = v? where v = (v!, v?, v°). 


Exercise 2.2.7. Using the discussion above, Exercise 2.2.6 and Exercise 2.2.5, show that, for a 
function f: M — R (with M parametrized by x(u, v) = (x; (u, v), x2(u, v), x3(u, v))), 


and 


af ax; Ox; of 0° x; 
xy [xu Lf] _ we OX; Ox; Ou av + Ox; duav- 


Now reverse the roles of u and v. What do you notice? 


Exercise 2.2.8. Suppose M is a surface which is the level set of a function g, g(x, y,z) =c. 
Show that Vg(p) is a normal vector for M at every p € M. Hint: take v € T,(M) and show 
Vg(p)- Vv = 0 using the definition of v and the chain rule. Hence, for level sets we have another 
way of obtaining a normal. Try this for the sphere g(x, y,z) = x7 + y? +27 = R’. 


Important Note 


In calculus, the directional derivative is generally defined using a unit vector v. The reason for 
this is simply that we wish to detect the change in a function in the v-direction and we do not 
want the magnitude of v artificially affecting our computation. For this very reason we will now 
take unit normals to surfaces. Let 
_ Xu X Xp 
[Xu X Xy| 

Now when we ask how U changes in a given direction, only U’s change in direction will 
be measured — and it is this quantity which will detect the shape of the surface. A normal 
vector U to a surface M takes any point of M, p, and assigns a vector U(p) € R? to it. Now, 
U(p) = (u'(p), u?(p), u3(p)) since it is a vector in R?, so we may write 


U =(u!,u?, u’) 
or, in the basis e; = (1, 0, 0), e2 = (0, 1, 0), e3 = (0, 0, 1) for R3, 


U= ule +u7e, + ure; 


3 
= ) ule; 
i=] 


where u!, u?, u> are functions from M to R. Because U assigns vectors to each p € M, we say 
that U is a vector field on M. Similarly, as we mentioned above, any assignment of a vector in 
R? to each point of M which varies smoothly over M is also called a vector field on M. 

How should we describe the change in U in a direction v € 7,(M)? We can simply look at 
the changes in the functions u', u?, uw in the v-direction. That is, we consider the directional 
derivatives of the u' in the direction v. Of course, we are only interested in the initial rate of change 
of U in the v-direction because v is situated at p, so we evaluate derivatives at 0 (for a(0) = p). 
We give the name covariant derivative to this initial rate of change of U in the v-direction. We 
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have glossed over one point here. Namely, in order to take derivatives in R?, we need U to be 
defined in an open set, not simply on M. We can satisfy this condition by extending U to be 
defined in a fattened M. It really doesn’t matter how the extension is defined in the open set as 
long as it agrees with U on M. For instance, we could simply move away from M a little bit in the 
normal direction and take the extension at that point to be the same vector U at the corresponding 
point of M. That the particular extension does not matter is shown in Exercise 8.3.1 (also see the 
surrounding discussion where the covariant derivative is discussed in a more general way). 

Although the notation is somewhat strange, remember that the covariant derivative is just the 
usual directional derivative applied to each coordinate function of U. This is denoted by 


VyU © (v[u'], viv), vu?) 


3 
= Yo v[u'le:. 
ix! 


In fact, what we have defined is really the covariant derivative in R?. In order to obtain a 
covariant derivative on M at a point, it is necessary to project the R?-covariant derivative onto 
the tangent plane of M at the point (see Section 2.2). Lemma 2.2.10 says that this final step is 
unnecessary in the case of the unit normal vector field U. 


Exercise 2.2.9. Suppose Z = >, z* (x1, X2, X3) €, is a vector field on a surface M: x(u,v) = 
(x1(u, v), X2(u, v), x3(u, V)). First show that 


azk ax; 
V,Z= aya 
= ¥(3 ie) 
and 


az* Ox; 
V,Z= pales 
7 > 7 Oxi =) “ 


Then show that second covariant derivatives commute. That is, use the chain rule (and Exer- 
cise 2.2.7) to get 


a2zk Ox; ax; azk 97x; 
Vx, Vx, Z = oe Ox, : 
= d {5 *. Oxj9x; du dv s ax; oud | [ 


= V,,Vx,Z. 


Now let’s return to the case of the unit normal vector field U. From the definition of v[-], we 
see that 


3 
d. 
WU =D) sua ino & 
i=I 


where a(0) = p and a@’(0) = v. Notice that, if U(t) is defined along a curve a(t), then the chain 


Tule applied to each coordinate allows us to write U, = it = VaryU. From our discussion above, 


we know that V,U tells us how M curves in the y-direction. That is, Vi.)U tells us about shape. 
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We therefore define 
S,(v) = —V,U 


to be the Shape Operator (or Weingarten Map) of M at p. The negative sign in the definition 
of S, is simply a convention which will allow us to give “positive” and “negative” curvature 
appropriate meanings. Also, note that choosing —U as normal reverses the sign of S as well. S$ 
is called an operator because of the following 


Lemma 2.2.10. 5, is a linear transformation from T,(M) to itself. 


Proof. Recall that T: V > V isa linear transformation of the vector space V to itself if it has 
two properties: (1) T(v; + v2) = T(v|) + T(v2) and (2) T(c - v) =c- T (vy) for any constant c. 
To prove the lemma, the first thing we have to show is that S,(v) is another vector in T,(M). 
To show this, it suffices to prove that S,(v) is perpendicular to U (since being perpendicular to 
U at p defines T,(M)). Now U-U = (u')? +(u?)? + (uw?) = 1 since U is a unit vector field 
and the usual Leibniz rule gives v[U -U] = 2V,U - U (show this!). Thus, for any v, 0 = v[1] = 
v[U -U] =2V,U - U. Hence V,U -U = 0 and V,U is perpendicular to U. 

To show S, linear, we must show that (1) S,(av) = aS,(v) for a € R and (2) S,(v + w) = 
S,(v) + S,(w). Property (1) follows immediately from the definition of V,U. Property (2) follows 
once we know that, for any function f, (v + w)[f] = vLf] + wL/]. By the following exercise, 
we are done. 0 


Exercises 2.2.11. Let a(t) = x(u(t), v(t)), B(t) = x(u(t), o(t)) with a(0) = p = A(O) and 
a’(0) =v, B’(0)=w. Show that y’(0)=v+w for y(t) = x((u(2r) + a(2r))/2, (v(2t) + 
0(2r))/2). Then, using the gradient formula, show that (v + w)[f] = v[f] + wL/]. 


Because S,: T,(M) — T,(M) is a linear transformation, we may bring to bear all the tools 
of linear algebra to analyze it. In particular, $, has a matrix form (which we discuss below) and 
standard invariants such as determinant, trace and eigenvalues have deep geometric significance. 
In this sense, the shape operator is the most fundamental of all the tools used to study the geometry 
of surfaces embedded in R°. 


Example 2.2.12 (M is a plane in R°). 

We know U = )-u'e; is constant. Because the u' are constant, v[u'] = 0 for all v. Hence, 
S,(v)=—-VU =—>> y[u‘]e; = 0 for all v. This makes sense since a plane is flat — that is, the 
shape operator detects no “shape”. 


Example 2.2.13 (57(R) is the sphere of the radius R). 
We shall use the standard parametrization of the R-sphere given by x(u, v) =(Rcosucosv, 
Rsinucosv, Rsinv) with tangent basis vectors 

x, = (—Rsinucosv, Rcosu cos v, 0), 

xX, =(—Rcosusinv, —Rsinusinv, Rcosv). 


Also, the unit normal is given by U = (cosucosv, sinucosv, sinv). To understand the shape 
operator S on S*(R) it suffices to know what S does to the basis {x,,, x,}. From our discussion 
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above we know that x,[f] = a = f, and similarly for v. Therefore, 


S(x,) = —Vx,U 
= —(x,[cosu cos v], x,[sinu cos v], x, [sin v]) 
= —(—Sinu cos v, cosu cos v, 0) 
Xy 
==> 
A similar calculation shows S(x,) = —%. Thus, the shape operator multiplies every tangent 
vector by —%. The corresponding matrix is — 4 7, where J is the 2 x 2 identity matrix. 


Exercise 2.2.14. Let M be the cylinder x? + y*? = R? parametrized by x(u, v) =(Rcosu, 
Rsinu, v). Show that the shape operator on M is described on a basis by S(x,) = —£x, and 
S(x,) = 0. Therefore, in the u-direction the cylinder resembles a sphere and in the v-direction a 
plane. Of course, intuitively, this is exactly right. Why? 


Exercise 2.2.15. Show that the shape operator S for the torus x(u, v) = ((R + rcosu)cos v, 
(R+prcosu)sinv, rsinu), on a basis, is given by S(x,) = —x,/r and S(x,) = (—cosu)/ 
(R+rcosu)Xy. 


Exercise 2.2.16. Show that the shape operator S for the saddle surface z = xy parametrized by 
X(u, v) = (u, v, uv) Is given on a basis by 


—uv 1+? 
S(x,) = ee + Shan RAY NT Uv 
(1+ u2 4+ v?)2 (1 +u? +02)? 
1+u2 uv 
S(x,y) = Xp. 


——_ x, SS 
(lh+u24v2ji "(1 4u2402)3 


We have already seen that a plane has zero shape operator. Intuitively, since the shape operator 
detects the change in the unit normal U, a zero shape operator for a surface M should imply 
that M is a plane. This is verified by the following result. Notice that the algebraic condition on 
the shape operator must be translated into a geometric restriction on M — a restriction which 
characterizes a plane. Recall that a plane is described by a point p and a normal vector U which 
we think of as originating from p. Then the plane is given by the collection of all points g with 
(q-p):U=0. 


Theorem 2.2.17. If S, = 0 at every p € M, then M is contained in a plane. 
Proof. Fix p € M with unit normal U(p) at p and take an arbitrary point g € M. We will show 
that q is in the plane determined by p and U(p). That is, we will show (gq — p)- U(p) = 0. Since 


q is arbitrary, then all of M will be in this plane. Take a curve a in M witha(0) = g anda(1) = p 
and define a function 


f(t) = (q — a(t)) - U(a(t)). 
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The product rule and the fact that 44 = VairyU allow us to compute the derivative of this 
P di (t) P 


function. 


f'(t) = —a"(t) - U(@(t)) + (9 — a(t) - Var U (a(t) 
=0 


since a(t) € Tai)M is perpendicular to U (a(t)) and Vai) (a(t)) = —Sayy(a’(t)) = 0 by hypoth- 
esis. Thus, since the derivative of f is zero, the function f(t) is constant. To see what constant 
this is, we can evaluate f at 0. But clearly ((0) = 0 since a(0) = q, so (gq — a(t))- U(a(t)) = 0 
for all t. In particular, for t = | we get (¢ — p)- U(p) = 0 and we are done. O 


2.3 The Linear Algebra of Surfaces 


Now, it should be clear that even for slightly complicated examples, an exact global representation 
for S on a surface M may be difficult to achieve. Even if we do obtain an exact representation of 
the shape operator (on a basis say), the form of the representation may not give much information 
(e.g., the saddle above). We can still cull the essential geometry of M, however, from computable 
quantities associated to the shape operator. Therefore, the usefulness of the shape operator itself is 
derived not from its computability, but rather from its theoretical description of geometry in terms 
of linear algebra. With this in mind, we shall prove one theoretical fact about the shape operator. 
The computations in the proof will be of importance later. Before we do this, it is worthwhile to 
recall a few notions of linear algebra. 

Suppose T: V — V isa linear transformation of the vector space V to itself. If we are given 
a basis B = {x,,...,X,} for V, then T may be represented by a matrix A. This correspondence 
works in the following way. First, T(x;) = )°"_, a/x, since B is a basis. Second, for fixed i, the 
a/' may be assembled into the i” column of a matrix A. Doing this for each i gives A = (a/"), 
an n x n matrix which takes e; = (0,..., 0, 1/,0,...,0) (thought of as a column vector and 
representing x;) to a= (a"',a”,..., a"). Here the vectors x; and T(x;) have been identified 
with their respective matrices of coefficients e; and a with respect to the basis B. Notice that, 
while the linear transformation T is defined without regard to a chosen basis, its representation as 
a matrix A depends essentially on which basis is chosen. Choosing a different basis for V changes 
the coefficients in the expansion of T(x;) and so a new matrix arises. For a linear transformation 
T, if there is some nonzero vector v and real number A such that 


T(v) = Av, 


then A is called an eigenvalue of T associated to the eigenvector v. If V is n-dimensional and 
there are n linearly independent eigenvectors v,,--- , V, with associated eigenvalues A|,--- , An, 
then, in this basis for V, the matrix for T is diagonal (and hence very simple) 


Ay 0 
A2 
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Therefore, det(7) = []}_, A; and tr(T) = }°y_, Ai. In general, the determinant of a matrix is the 
product of its eigenvalues and the trace is the sum of the eigenvalues. 


Example 2.3.1. Let 7: R? + R? bea linear transformation with T(1, 0) = (4, 3) and 7(0, 1) = 
(—2, —1). The matrix for T with respect to the basis {(1, 0), (0, 1)} is 


eal 


Suppose we have 


Subtracting, we get 


4-2 —2 aj _ |90 
3. -1-A}Lb{~ [o]’ 
But a matrix can only take nonzero vectors to zero if det(T) = 0. Hence 


4-iX —2 
aet| 3 Perla 


so (4—A)(—1 —A) + 6 = Dora? — 34 +2 =0 = (A — 1) — 2). Thus A = 1 or A = 2. These 
are the eigenvalues of 7. To find the eigenvectors, plug 4 back into the matrix equation to get, 


3 -—2])I/a 0 3 
ForA = 1: E SI6l= (ol or 3a—2b=0 or gia 


This means that any vector of the form [a, 34 = v will satisfy T(v) = v (since 4 = 1). Indeed, 


take [2, 3] and note that 
sie” A he ee 
3 -1]/3) [3° 


Exercise 2.3.2. Show that the other eigenvector is given by [a, a]. 


Exercise 2.3.3. Show that, if [2, 3] and [1, 1] are chosen as a basis for R?, then the matrix for T 
relative to this basis is 

1 0 

0 24 


Finally, note that A;A2 = 2,A, + A2 = 3 and, calculating from the original matrix, these are equal 


to 
4 -2 4 2 
aet (3 ri) ato =2 «(5 Ti) =4-1=3. 


A linear transformation T: R* — R? is said to be symmetric if T(v)- w = v- T(w) for all 
vectors v and w in R*. (When more general scalar products are used, this relation is sometimes 
written (T(v), w) = (v, T(w)).) 
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Exercisee 2.3.4. 

1. Show that, with respect to any orthonormal basis, if the 2 x 2 matrix [2 A represents a 
symmetric linear transformation, then b = c. Such a matrix is called a symmetric matrix. Recall 
that a basis is orthonormal if it consists of unit vectors which are perpendicular. 

2. Show that any 2 x 2 symmetric matrix has real eigenvalues. 


Theorem 2.3.5. The shape operator is a symmetric linear transformation. Further, S(Xu) - Xu = 
Xuu ( U, S(Xy) + Xy = Xuy - U and S(x,y) -Xy = Xyy- Uz 


Proof. Since {x,, Xy} is a basis for 7,(M) we need only show the equation on x, and x,. Note 


first that U -x, = 0 =U -x, since U is anormal to 7,(M). Now U - x, and U - x, are functions, 
so a vector may be applied to them. 


0 = x,[0] =x,[U -x,] 
; ox! 
[Eee 
= -Y (m5 = pin E |) by the Leibniz Rule 
= (stele xytu’ — ) 


= Vx,U -xy + U - Xuy. 


Hence, S,(x,) +X, = —Vx,U -x, = U -x,,. A similar computation shows 
Sp(Xy) - Xy = ~Vx,U +x, = U + Xyy. 


But x,, = X,, So the result follows. Finally, the same calculation as above shows S(x,)-X, = 
Xun ° U and S(x,y) -Xy = Xyy - U. O 


Corollary 2.3.6. The shape operator has real eigenvalues. 


We have seen in Lemma 2.2.10 that, for a surface M: x(u, v), the shape operator S takes 
tangent vectors to M to other tangent vectors. In particular, S(x,,) and S(x,) are tangent vectors. 
Because {x,, X,} is a basis for the tangent plane (at any point of 4), we then know that we can 
always write S(x,) = ax, + 5x, and S(x,y) = cx, +d x,y. If we now make the definitions 


E =X, Xu, FHXy-Xy, G=Xy-Xy, 
1 = S(x,y) + Xu, m = S(Xy)-Xy = S(Xy)- Xu, n = S(Xy)- Xo, 
then we can obtain the equations 


l=aE+bF,m=cE+dF,m=aF+bG,n=cF+dG. 
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These can be solved for a, b, c and d (see Section 2.5 or solve by hand). 


Fm—-Gl —FIl+Em En-—Fm —mG+Fn 
a= ~-—_.,, b = ——_____,, d = ———_-_, c = ——__—_—.. (23.1) 
EG — F? EG-— F? EG — F? EG— F? 
We will use these calculations to compute the shape operator in Section 2.5. 
There is yet another way linear algebra makes its presence felt in differential geometry. Given 
a map of surfaces F: M — N, we may define its derivative to be a certain linear transformation 
on all tangent planes F,,: T)(M) — Tr,p)(N). This linear transformation (which we shorten to) 
F, is defined by letting a tangent vector v be represented by a curve a: | > M with a(0) = p 
and a@’(0) = v and saying 


ef a 
F,(a'(0)) = 5 (Flatt) lo - 


Now, A(t) = F(a@(t)) is simply a curve on N and the right-hand side of the definition of F, is 
simply the velocity vector of B at F(p). The geometry of the definition may now be seen. In 
order to understand F, linearly approximate M near p by the tangent plane 7,(M). Each vector 
v in 7,(M) has a curve whose velocity vector is v, so use composition with F to map the curves 
over to N. The curves and their images under F fill out regions near p and F(p) respectively. To 
see the geometry of even this local mapping, however, we again must linearly approximate by 
taking derivatives of the image curves in N. Then, by definition, the velocity vector of the image 
curve is the image of the velocity vector of the original curve in M. 

We have made a big deal of this notion of the derivative of a surface mapping not only because 
it is a difficult idea, but also because it plays an important role in what is to follow. In particular, 
as we shall see later, surface mappings and their derivatives will give us the right tools to compare 
surfaces and their geometries. Of course, a linear transformation is determined by its effect on a 
basis, so, given a parametrization M: x(u, v), F, may be determined completely by calculating 
F,(x,) and F,(x,). 


Example 2.3.7. 

Let F: R? — R?* be a map of the plane to itself. In coordinates, we may write F(u,v) = 
({(u, v), g(u, v)). To calculate F,(x,,) we take the composition of the u-parameter curve (u, v9) 
with F, F(u, vo) = (f(u, vo), g(u, vo)) and differentiate with respect to u to get (fy, gu). A 
similar calculation for the v-parameter curve gives 


F (Xu) = (fas &u) and F,(Xy) = (fv, 8v)- 


Recalling the correspondence between linear transformations and matrices, we see that, with 
respect to the basis {x,,, x,}, the matrix for F, is 


1ny=|f fal 


Su Bu 


This is the Jacobian matrix of several-variable calculus. 


The derivative mapping of a surface map is, therefore, just a natural generalization to a map 
of surfaces of the Jacobian of a map of R” to itself. Now recall that, if f: R — Rhas f’(a) # 0, 
then, by continuity, f’(x) #0 for x € (a—€,a+e) for some € > 0. Then either f’(x) > 0 
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or f'(x) < 0 for all these x’s. Hence, f is strictly increasing or decreasing on this interval 
and, therefore, has an inverse function on the interval. A similar thing happens for a mapping 
F: M — N. This ts called the Inverse Function Theorem. 


Theorem 2.3.8 (Inverse Function Theorem). /f a map of surfaces F: M > N has deriva- 
tive transformation F,,: T)>(M) > Trip)(N) with det(F,,) #4 0, then there exists a small open 
neighborhood O of F(p) € N on which a smooth inverse function F~' is defined. 


The Inverse Function Theorem is saying that the linear algebra of F at a point determines whether 
M and N are the same near that point. 

Now, the shape operator is a linear transformation of the tangent plane, so we might well ask 
if there is a surface mapping whose derivative is +5? The answer is, in fact, yes. The mapping 
whose derivative is +S is called the Gauss map and its behavior provides a geometric alternative 
to the linear-algebraic approach of the shape operator. The Gauss map is a mapping G: M > S? 
from the surface M to the unit sphere S? given by G(p) = U(p), where U(p) is the unit normal 
of M at p. Since U(p) is a unit vector in R?, we may represent it as a point on S°, so this definition 
makes sense. Now, the Gauss map has an induced derivative map G,: T,M — Tg; ase which, 
by definition, is given by 


d 
G.(v) = F (Ga Ir=0 


= FU lat hao 
= VarnU 
=VU 

= -S(v), 


where a(0) = p and «’(0) = v. One thing needs to be made clear. Namely, the shape operator 
has range T,(M) while the derivative of the Gauss map has range Tgp) S 2, so how can G, = —S? 
We must look at the geometry of the situation for the answer. The tangent plane, no matter 
where we visualize it meeting the surface, is a vector space and, therefore passes through the 
origin. The plane 7,(M) is the plane through the origin perpendicular to U(p). For any point 
q € S*, the tangent plane 7,(S*) has the beautiful property that it is perpendicular to q itself, 
thought of as a vector in R*. Because G(p) = U(p) by definition, the plane Te(p(S *) is the plane 
through the origin perpendicular to U(p). That is, T,(M) = To(p)(S?). This explains how the 
computation above can make sense. We will see later that the Gauss map has many uses. For now, 


try 


Exercise 2.3.9. For the cone x(u, v) = (vcosu, v sinu, v), compute the Gauss map and its deriva- 
tive. Estimate the amount of area the image of the Gauss map takes up on the sphere. 


Exercise 2.3.10. Compute the Gauss map and its derivative for the cylinder x(u, v) = (Rcosu, 
Rsinu, v) and estimate the amount of area the image of the Gauss map takes up on the 
sphere. 
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Exercise 2.3.11. Compute the Gauss map and its derivative for the catenoid x(u, v) = 
(u, coshu cos v, cosh u sin v). Show that the Gauss map is a one-to-one map from the catenoid to 
the sphere. Hint: focus on the first coordinate first. 


Exercise 2.3.12. Compute the Gauss map for Enneper’s surface x(u, v) = (u — u3/3 + uv’, 
v —v?/3 + vu?, u? — v?) and show that it is a one-to-one map from Enneper’s surface to the 
sphere. Also show that the image of the disk {(u, v)|u? + v? < 3} under G covers more than 
a hemisphere of the sphere. Hint: first, write U in polar coordinates by letting u = r cos@ and 
v =rsiné and, then focus on the third coordinate. 


2.4 Normal Curvature 


We would now like to use the shape operator to obtain a notion of curvature of a surface. After a 
very geometric approach to this normal curvature we will prove a theorem linking the geometry 
to the linear algebra of S. First we note, 

Lemma 2.4.1. Ifa is a curve in M, thena” -U = S(a’)- a’. 

Proof. We will give three proofs of this result. The first reinforces the computations above, the 
second foreshadows the types of calculations we will do when we consider geodesics and the 
third uses the standard notation we shall use from now on. 


(1) We know a’ - U = 0 since U is normal to 7,(M). Then, 
0=a'[a’-U] 
=a’ bp oa 
i 
= Soa’ yu! + So aa'[u'] by the Leibniz Rule 
i i 


da 


dt @) 


da £ . : %, 
= X a3 ub+ Lower ‘a’ since a! (a(t)) = 
=a” -U+VyU - a’. 
Hence, S(a’)- a’ = —VyU -a’ =a" -U. 
(2) By Lemma 2.1.3, write a(t) = x(u(t), v(t)) and a’ = u’x, + v’x, by the chain rule. Applying 
the chain rule again gives 
= UX, + 2U'U'Xyy FV? Xy tux, + vx). 


Now, x, -U =0Oandx,-U =0,so 


2 2 
a” U = (ul xy, + 2u'v'Xyy + UX) U- 
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Because S is a linear transformation, we have 


S(a’) - oe" = S(u'x, + v’xy) - (u’x, + v'xy) 
= (u'S(x,) + v’S(x,)) - (u’x, + v’x,) 
= u’*S(x,) -X, + u’v' S(X,) «Xp + u’V’ (xy) Xp + Vv S(Xy) © Xp 
=u?x,,-U tu'v'X,)-U $u'v'X,-U tv? x,y -U 
= (ul Xyy + Qu’ Xyy + v” Xyy) -U 
=a’-U. 
(3) We take the derivative of 0 = a’ - U to get 


0 =(a'-UY 
=a”-U+a'-U' where U’ = VyU = —S(a’) 
a”. U = —a' -(—S(a’)) 
a” -U =a’ - S(a’). 
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We interpret a” - U as the component of acceleration due to the bending of M. Of course, we 
assume that @ has unit speed so that the magnitude of a’ does not affect our measurement. With 
this in mind, we make the following definition. For a unit vector u € T,(M), the normal curvature 


of M in the u-direction is 
k(u) = S,(u)- u. 
Let a be a unit speed curve with a(0) = p, a’(0) = u. Then 
k(u) = S,(u)-u 
= Sp(a'(0)) - (0) 
= a"(0)-U(p) 
= k(0)N(0)- U(p) 
= x(0)cosé 


where N is the Frenet normal to the curve @ and x is @’s curvature. The angle @ is the angle 


between N(0) and U(p) (see Figure 2.16). 


Figure 2.16. Normal curvature 


2.4. Normal Curvature 93 


Exercise 2.4.2. The total torsion of a unit speed curve a: [a, b] > R? is J > ¢ dt. Show that a 
closed curve on an R-sphere has zero total torsion. Hints: use the formula above, k(T) = ka cos 6, 
where T =a’ for a unit speed curve a, the shape operator of the sphere and differentiate 
cosd=N-U. 


While the normal curvature is a multiple of the curvature of a, we might ask if there is a 
curve o whose curvature is exactly equal to the normal curvature k(u). The answer turns out to 
be yes. 


Proposition 2.4.3. Let P denote the plane determined by U(p) and u (at p € M) and let o 
denote the unit speed curve formed by P 1 M with o(0) = p. Then k(u) = +k, (0). 


Proof. First we show that o’(0) = u. But this follows since u and o’(0) are unit tangent vectors 
and u, o’(0) and U(p) lie in a single plane. Namely, the only way for u and o’(0) to both 
be perpendicular to U(p) and in P is for o’(0) = tu. Take a parametrization with o’(0) = u. 
Now oa’s normal N,(0) is perpendicular to 7,(0) = o'(0) = u and N,(0) is in the plane P 
since o ia a plane curve. Thus, N,(0) = +U(p). Then cos@ = +1 since 6 = 0 or xz. Hence 
k(u) = +k, (0). O 


What does normal curvature tell us? If k(u) > 0, then the normal N,(0) is equal to U(p) (i.e., 
6 = 0°) and o bends up toward U(p). Hence, so does M along o. If k(u) < 0, then the normal 
N,(0) is equal to —U(p), so, along o, M bends away from U(p). If k(u) = 0, then «,(0) = 0. 
This does not mean that o is a line, of course, since we don’t know that «,(t) = 0 for all t. But it 
does say that near p the rate of bending is small. 

Therefore, the sign of normal curvature tells us about the bending of M toward or away from 
its normal in a given direction. Note that by changing the normal to —U(p) the signs on k(u) 
reverse. Therefore, to avoid this ambiguity, we must fix a convention for normals. For instance, 
if a surface M completely encloses a volume, then we could say that we will always take the 
outward-pointing normal; that is, the normal which points away from the enclosed volume. If the 
surface is given as a graph of a function of two variables x and y, then we can always take the 
normal with positive z-coordinate. 

Normal curvature is a function from unit vectors in a plane — that is, a circle of radius 1 — to 
real numbers. In fact, it can be shown that this normal curvature function is continuous. Just as 
any continuous function on a closed interval attains its maximum and minimum, so too does any 
continuous function on a compact (i.e., closed and bounded) set such as the circle. Hence, there 
are unit vectors u; and u2 such that 


k(a,) =k; = max k(u), k(u2) =k, = min k(u). 


The unit vectors u,; and u2 are called principal vectors and k, and kz are principal 
curvatures. 


Exercises 2.4.4. A curve a: | > M is a line of curvature if o'(t) is an eigenvector of the 
shape operator S for every t € J. This is equivalent to saying that the unit tangent 7, is 
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always either u; or u2. Show that @ is a line of curvature if and only if a’ is parallel 
to U'=V,-U along a. Further, suppose @ is a plane curve. That is a = MMP for some 
plane P. Then show that a@ is a line of curvature if the angle between M and P is constant 
along a. 


Let’s do an example before we go on just to see how normal curvature may be computed. We 
will make use of the Frenet Formulas. 


Example 2.4.5 (Saddle Surface Normal Curvature). 
Let M bea saddle surface z = x? ~ y*. We use the normal U given by the gradient of g(x, y, z) = 
z—-x*+y=0, 
U= Vg (—2x, 2y, 1) 
Vel /1 4 4x2 + 4y? 

Let p = (0, 0, 0). Then U(p) = (0, 0, 1). Take u = (1, 0, 0) for example. The curve o determined 
by the intersection of the plane spanned by (0, 0, 1) and (1, 0, 0) with M is the parabola z = x? 
bending toward the normal. Therefore, we should find k(u) > 0. Let’s see how to compute 
explicitly. 

The plane P through u and U(p) is the xz-plane with normal vector u x U(p) = (0, —1, 0). 
Hence, the equation which describes P is y = 0 and o is the parabola z = x”, which may be 
parametrized by o(t) = (t, 0, t7) say. Then o’(t) = (1, 0, 2f) and a(t) = (0, 0, 2). Then 


|o’(0) x o"(0) 


k(1, 0, 0) = +x,(0) = 
( ) = +k, (0) oF 


=2/1=2>0 


just as we thought. Note. We could find the entire Frenet Frame: 


o’ 1 2t 
PS 2 (4 
lo’| (a Fa) 
with 7(0) = (1, 0,0); B= S255 = (0, -1, 0); N = B x T = (0,0, 1) = U witha + sign. This 


o’xo"| 


justifies k(1, 0, 0) = +«,(0). 


Exercise 2.4.6. Find k(u) at p = (0, 0,0) where u = (0, 1, 0). What sign should your answer 
have? 


Exercise 2.4.7. Find k(u) at p = (0, 0, 0) where u = (+: 4, 0). 

Example 2.4.8 (Cylinder Normal Curvature). 

Let M: x?+y?=1 be a cylinder with p =(1,0,0) and U(p) =(1, 0,0). A unit vector 
u€7,(M) has the form u = (0, u!, u?) with (u')? + (v2) = 1. A normal for the plane de- 
termined by u and U(p) is (0, —u’, u!), so the plane’s equation is z = (u?/u')y. The inter- 
section of the plane with M is the set {(,/1 — y?, y, (u/u')y)} for any y. Parametrize o by 
a(t) =(V1 — 8, t, (u2/u')t) with 


ot) = (-t/V1=?, 1,u/u') and o"(t) = (-1/(1 —?)*, 0,0). 
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Figure 2.17. max; k(u) = 0; 
u' = 0; u = (0, 0, 1) indirection 
of rulings 
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Figure 2.18 min; k(u) = —1; 
u! = 1;u = (0, 1, 0) in direction 
of velocity vector of directrix 
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Then we have, 


T(0) = (0, 1, u?/u!)/V1 + (u2/u!, 


B(O) = 0'(0) x 0 (0)/|o'(0) x o(0)| 
= (0, —u?/u',1)//1 + (u2/u'? 


N(0) = ((-W?/ulY — 1/0. + (w?/u!)), 0, 0) 
= (—1,0, 0). 


Hence, N(0) = —U(p), so we need a — sign in k(u) = —x,(0). Further, 


lo'(0) x o"(0)] (U2 /ulY + 1)? 


eer ee! pe ee Swe 1\2 
|o’(0)/3 ~ d+ (u2/u!)2)3/2 = (uy. 


Kg(0) = 
Since (u!)? + (u2)? = 1. Hence, k(u) = —(u')’. This is negative or zero. Now, since u = 
(0, u!, u*) is on the unit circle in the yz-plane, max k(u) = 0 occurs when u! = 0 and mink(u) = 
—1 occurs when u! = 1. The corresponding geometry is clear. The cylinder M is flat in the ruling 
directions and bends away from the normal in directrix directions. Indeed, the bending is what 
we might call circular. See Figure 2.17 and Figure 2.18. 


Exercise 2.4.9. Work through the example above. 


Although it was defined in terms of the shape operator originally, we have seen that normal 
curvature may be described completely geometrically as the curvature of the curve of intersection 
of the surface with a particular plane. We will now present a fundamental theorem linking the 
geometry of normal curvature to the linear algebra of the shape operator. Say that a point p « M 
is an umbilic point if the principal curvatures at p are equal (i.e., k;(p) = k2(p)). Note that this 
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implies that the normal curvature at p is constant. For instance, every point on a sphere is an 
umbilic point. 


Theorem 2.4.10. 
1. If p € M is umbilic, then S,(u) = ku where k = ky = kp. 
2. If p € M is not umbilic, then there are exactly two perpendicular unit eigenvectors of Sp 
with associated eigenvalues the principal curvatures at p. 


Proof. We will prove the theorem by going in reverse. We shall consider the eigenvalues of the 
shape operator and show that these are, in fact, k; and k2. Let u, be a unit eigenvector for S with 
eigenvalue A; S(u;) = A; u;. Let up be the unit tangent vector rotated 90° counterclockwise from 
u;; that is, up = U x u,. We may write S(u2) = au; + buy. Buta = S(uo)- u, = S(u;)- u2 = 
A, Uy + Uy = 0, so S(u2) = Az U2, where b = Az. Thus, up is the other eigenvector of S and is 
perpendicular to u, by construction. Also, k(u;) = S(u;)-u; = A; uj; -u; = A;, so Ay and A2 are 
normal curvatures at p as well. 

Note that, ifA; = Az =A, then S(u) = A u forall uand p is umbilic. So, suppose without loss of 
generality that A. < A, and take a unit vector u which may be written as u = cos 6 u; + sin up, 
where 6 is the angle between u and u;. We may now compute the normal curvature at u, denoting 
the dependence of u on 6 by writing (6). 


k(6) = S(u)-u 
= S(cos 6 uy + sin uz) - (cos6 u; + siné uz) 
= (cos6 S(u;) + sin@ S(uz)) - (cos @ uy + sin@ uz) 
= cos? S(u;)-u, + sin@ cos@ S(u;) - uy sind cos@ S(uy)- u; + sin? 6 S(uz) - up 


= cos? 6A; + sin? 6 Ad. 


Now write k(@) = A; + (Az — Ay) sin® 6 using sin? 6 + cos? @ = 1. Since A2 < Ay, k() is a max- 
imum when 6 = 0; that is, when u = uy. Hence, ky = kmax(9) = k(uy) = Ay. 

Similarly, we may write k(@) = (A; — Az) cos? @ + Az and, since Az < A1, k(@) is a minimum 
when 6 = 5. That is, k(@) is a minimum when u = up. Hence, kz = kmin(@) = k(u2) = Az. L 
Corollary 2.4.11. For u = cos@ u, + sin @ up as above, the normal curvature is given by Euler’ 
Jormula 


k(u) = cos? 6 ky + sin? @ ky. 


2.5 Surfaces and Maple 


In this section we will see how to plot surfaces and calculate shape operators and normal curvatures 
from parametrizations using Maple. We start with 


> with(LinearAlgebra):with(plots): 
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Figure 2.19. Enneper’s surface 


Let's look at Enneper’s surface first. We define a parametrization in Maple as a vector of 
functions of two variables as follows. 


> Ennep:=<u-u73/3+u*y"2|v-v~3/3+v+u"2|u-2-v~2>; 


I | 2 2 
Ennep := a = w+urv,v— 3 veto, Vv 


We can now plot Enneper’s surface (see Figure 2.19). There are various options for the Maple 
plot3d command and some are listed in the command. For example, “shading=zhue” gives a 
certain color scheme to the plot, Other choices for shading are “XY”, “XYZ” and “Z” for 
example. The option “lightmodel =light1” shines on the plotted surface from a certain direction. 
The “orientation” option simply chooses a viewing position for the observer of the plot. Finally. 
“scaling=constrained” makes the plot look exactly as it should without the usual computer scaling 
resulting from monitors being wider than they are tall. 


> plot3d(Ennep,u=-2..2,v=-2..2,scaling= constrained, shading 
=zhue, lightmodel=light1,orientation=[89,54]); 


Exercise 2.5.1. Using the parametrizations given in Exercise 2.1.22 and Exercise 2.1.23, plot the 
hyperboloid of one sheet. Experiment with different bounds on the parameters as well as plotting 
options. 


We can write procedures which calculate the basic quantities associated to surfaces: the 
coefficients E, F and G; the unit normal; and the coefficients /, m and n- 


> EFG := proc(X) 
local Xu,Xv,E,F,G; 
Xu := <diff(X[1] ,u) ,diff(X[2] .u) ,diff(X[3] ,u)>; 
Xv := <diff(X[1],v),diff(X[2],v),diff(X[3],v)>; 


E := DotProduct (Ku, Xu, conjugate=false) ; 
F := DotProduct (Xu, Xv, conjugate=false) ; 
G := DotProduct (Xv, Xv, conjugate=false) ; 
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si 
en 


3S. 
lo 
Xu 
Xv 
Z 
Ss: 


mplify((E,F,G]); 
d: 


UN := proc{X) 
cal Xu,Xv,Z,s; 

:= <diff(X[1] ,u) ,diff(X[2] ,u) ,diff(X[3],u)>; 

:= <diff(X(1] ,v) ,diff(X[2] ,v) ,diff(X[3],v)>; 
:= CrossProduct (Xu,Xv); 
=VectorNorm(Z,Euclidean,conjugate=false) ; 


simplify (<Z[1]/s|Z[2]/s|Z[3]/s>,sqrt ,trig,symbolic) ; 


en 


> 


d: 
Imn := proc(X) 


local Xu,Xv,Xuu,Xuv,Xvv,U,1,m,n; 

Xu := <diff(X[1],u) ,diff(X[2] ,u) ,diff (X[3] ,u)>; 

Xv := <diff(X[1],v) ,diff(X[2] ,v) ,diff(X[3],v)>; 

Xuu := <diff(Xu[1] ,u) ,diff(Xu[2] ,u) ,diff (Xu[3] ,u)>; 
Xuv := <diff(Xu[1i] ,v) ,diff(Xu[2] ,v) ,diff(Xu[3] ,v)>; 
Xvv := <diff(Xv[1],v) ,diff(Xv[2] ,v) ,diff (Xv [3] ,v)>; 
U := UN(X); 

1 := DotProduct(U, Xuu, conjugate=false) ; 

m := DotProduct(U,Xuv,conjugate=false) ; 

n := DotProduct(U,Xvv,conjugate=false) ; 
simplify([1,m,n] ,sqrt,trig,symbolic); 

end: 
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These basic ingredients allow us (see (2.3.1)) to write procedures for the shape operator and, 
hence, normal curvature. 


> 


lo 


Baaenrn.< 


[Ss 
en 


shape: =proc(X) 
cal Y,Z,a,b,c,d; 


:=EFG(X) ; 
:=1mn(X); 
:=simplify((Z[1] *Y [3] -Z [2] *y [2] )/(y¥ [1] *Y(3]-y[2]72)); 
:=simplify((Z([2]*Y[1]-Z[1] *Y [2])/(Y[1]*Y[3]-Y[2]~2)); 
:=simplify((Z[2] *Y[3]-Z[3] *y [2])/(Y(1] *Y[3]-Y[2]~2)); 
:=simplify((Z(3] *Y [1] -Z [2] *y [2] )/(y[1] «Y[3]-Y[2]72)); 


(x_u) =a*x_ut+b*x_v,S(x_v)=c#x_utd*x_v]; 
d: 


As a prelude to the things we will consider in Chapter 3, we also give the following procedure 
for the matrix of the shape operator with respect to the basis {x,,, xy}. 


> 
lo 
Y 


vA 
a 
b 
Ge 
d 
M 


shape_matrix:=proc(X) 
cal Y,Z,a,b,c,d; 


:=EFG(X) ; 
:=1lmn(X); 
:=simplify((Z[1] *Y [3] -Z [2] *¥ [2])/(y[1]*Y[3]-yY[2]72)); 
:=simplify((Z[2]*y([1]-Z[1] *y (2])/(y(1]*Y(3]-Y[2]72)); 


=simplify((Z[2] *Y [3] -Z[3] *¥ [2])/(Y[1] *Y[3]-¥[2]*2)); 


:=simplify((Z[3] *Y[1]-Z(2] *Y(2])/(yY([1] *Y[3]-Y[2]*2)); 
atrix(([[a,c], [b,d]]); 


end: 
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Now let’s compute a few examples. We will give the shape operator evaluated on the basis 
elements x, and x,, the corresponding matrix (with respect to that basis) and, sometimes, certain 
linear algebraic invariants associated to that matrix. First, let's verify that the shape operator on 
the R-sphere is simply multiplication by —1/R. 


> shape(<R*cos(u)*cos(v) |R*sin(u)*cos(v) |R*sin(v)>); 


[Six-x) = -—, S(Qv_vy) = -=] 


Now let's look at a torus of revolution and Enneper’s surface. 


> shape(<(R+r*cos(u))*cos(v), (R+r*cos(u))*sin(v), 
r*sin(u)>); 
_ Xu _  cos().x_¥ 
[si , ma R+ ore] 


> shape(Ennep) ; 


2xu 


SR a ene iS 
[sem 1+2u?+2v? + «4+ 2u? v2 + v4 


S(x.v) 2x_v 
. Vv =__ 
1+20?4+2v?+u4+2u2v?+04 


Exercise 2.5.2. Describe the similarity between the forms of the shape operators of the sphere 
and Enneper's surface. Hint: see Proposition 4.8.28 and Exercise 4.8.30. 


Figure 2.20. A cylinder 


Now consider a cylinder, Notice that S(x,) = 0 and observe that this means that U does not 
change along a ruling. Hence, a cylinder is developable, see Exercise 3.2.20. 


> cylinder:=<R*cos(u) |R*sin(u) | v>; 
cylinder :=[Rcos(u), Rsin(u), v] 
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> shape(cylinder) ; 
XU 
[Se = -, sev) = 0| 


> plot3d(subs(R=2,cylinder) ,u=0..2*Pi, v=0..3,shading=XY, 
lightmodel=light1,grid=[25,10], orientation=[-46,61]); 


So far, the matrices for the shape operator have been diagonal. This is not usually the case 
and the saddle surface provides an example of this. While, for diagonal matrices, we can spot the 
eigenvalues immediately as the diagonal entries and the determinant as the product of these, we 
must work harder to find such quantities for a general matrix. Maple makes this easy however. 


> shape(<u,v,u*v>) ; 


: - uUvxU (1+ v?)xv 
C4) = — Tye ed + 4 ut as vO’ 
S(.v) = (1 +u?)x_u uUXxX_V 


(1 +u2 4+ 02)@/) (1 +. 2? + v2)G/2) 


> Eigenvalues(shape_matrix(<u,v,u*v>)); 
~2JVl4+wtvuvt2V3u2 vt ute? tu? vt t+ 142024202? +444 v4 
2(1+2u24+2v2 +u44+2u? v? + v4) 
—2V14u24+vuv—-2V3W vty tu? vt +1422 4202 4+u4 + v4 
2(14+2u24+2v2+u44+2u? v2 + v4) 


> Determinant (shape_matrix(<u,v,u%*v>)); 
1 
404+ vy 


Now compare this result with Exercise 3.2.16 and Exercise 3.2.18. 


Exercise 2.5.3. Carry out the following Maple command and then compare with Enneper’s shape 
operator. What is similar? 


> shape(<cosh(u)*cos(v) |cosh(u)*sin(v) Ju>); 

Exercise 2.5.4. Carry out the following Maple commands and then compare to Exercise 3.3.5. 
> surfrev:=<g(u) |h(u)*cos(v) [h(u)*sin(v)>; 
> shape(surfrev) ; 


> shape_matrix(surfrev) ; 
> Determinant (shape_matrix(surfrev)); 
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We now use the definition k(u) = S(u)- u to write a procedure for normal curvature. We 
need to input a specific point (i.e., the uO and v0) and a specific angle (i.e., 0 < t < 27) at 
which to find normal curvature. Note that S.(cos(t), sin(t)) is Maple’s notation for applying 
the matrix S to the (unit) vector (cos(t), sin(t)). Since we wish to specify a unit vector by 
(cos(t), sin(t)), we require x, and x, to be orthogonal unit vectors. In fact, however, we can 
even use the procedure when [x,| = [x,| and x, -x, = 0 as long as we don’t mind magni- 
fying the normal curvature by |x,|?. This follows since, for a vector v, we have S(v)-v = 
IviPk(v/|v|). 


> normal_curvature:=proc(X,u0,v0,t) 

local §; 

S:=subs({u=u0,v=v0},shape_matrix(X)); 
DotProduct(S.<cos(t) ,sin(t)>,<cos(t),sin(t)>,conjugate= 
false); 

end: 

> normal_curvature(<(Rtr*cos(u))*cos(v), 
(R+r*cos(u))*sin(v) ,r*sin(u)>,0,0,t); 


cos(t)? —_ cos(0) sin(t)? 
r R+rcos(0) 


Note that we are on the outermost point of the torus and that the maximum and minimum 
normal curvatures occur at t = 0 and ¢ = 2/2. We can plot the normal curvature at this point as 
well (see Figure 2.21). Note where the maxima and minima are. 


> plot (normal_curvature(subs({R=5,r=1}, <(Rtr*cos(u))* 
cos(v), (Rtr*cos(u))*sin(v) ,r*sin(u)>) ,0,0,t) ,t=0..2*Pi); 


Exercise 2.5.5. Plot the normal curvature for the catenoid x(u, v) = (u, cosh(u) cos(v), 
cosh({z) sin(v)). 


0.4 


0.2 


Figure 2.21. Normal curvature for a torus atu = 0, v = 0 
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Figure 2.22. A generalized cylinder Figure 2.23 A saddle as a ruled surface 


Maple can be used to create parametrizations for ruled surfaces by inputting two curves B() 
and 5(u). Here is a procedure to do this and several examples (see Figure 2.22 and Figure 2.23), 


> rule_surf:=proc(beta,delta) 
simplify (betat+tv*delta,symbolic) ; 
end: 
> rule_surf(<cos(u)|sin(u) |0>,<0]1/1>); 

[cos(u), sin(u) + v, v] 
> plot3d(rule_surf(<cos(u)|sin(u)|0>, <0|1|]1>),u=0..2#Pi, 
v=-1..1,scaling=constrained,shading=XY, orientation= 
[-24,63]); 
> plot3d(rule_surf(<u]0/0>,<O/ilu>), u=-1..1,v=-1..1, 
scaling=constrained ,shading=XY, lightmodel=light2, 
orientation=[-74,71]); 


A particular type of ruled surface is a tangent developable. Given a curve B(u), its tangent 
developable is defined to be the surface parametrized by x(u, v) = B(u) + uB’(u). We may 
create and graph the tangent developable of the helix (for instance) B(u) = (cos xu, sinu, «). See 
Figure 2.24 for the result. Also see Exercise 3.2.22. 


> plot3d(rule_surf(<cos(u)|sin(u)|u>, map(diff,<cos(u) | 
sin(u) |u>,u)) ,u=0..3.5*Pi,v=-4. .4,scaling=constrained, shading 
=XY, lightmodel=light2, orientation=[-133,73] ,grid=[50,20]); 

> plot3d(rule_surf (<cos(u) |sin(u) |u>,map (diff ,<cos(u) | 

sin(u) |u>,u)) ,u=0..3.5*Pi,v=0..4,scaling=constrained, shading 
=KY, lightmodel=light2,orientation=[-133.73] ,grid=[50,10]); 

> shape(rule_surf(<cos(u)|sin(u)|u>, map(diff,<cos(u) | 


sin(u) |u>,u))); 
J/2 xu : V2xv 


2v 2v ° sr y=0] 


[S@_n) = — 
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Figure 2.24. Tangedt developable of a helix and half of it 


> shape_matrix(rule_surf(<cos(u)|sin(u)|u>, 
map (diff ,<cos(u) |sin(u) |u>,u))); 


> Determinant (shape_matrix(rule_surf( <cos(u)|sin(u)|u>, 


map(diff,<cos(u)|sin(u)|u>,u)))); 
0 


Exercise 2.5.6. Create a ruled surface with A(w) = (0,0, sin(ku)) and &(u) = (cos a, sinu, 0). 
Plot this ruled surface for varying k. What do you notice? See Figure 2.25 for the case k = I. 


> plot3d(rule_surf(<0|0|sin(u)>, <cos(uv)|sin(u) 10>) ,u=0.. 
2*Pi,v=-1..1,scaling=constrained, shading=XY,lightmodel 


=light2,orientation=[-27 ,54]); 
Here is an example of a tangent developable (see Figure 2.26. The other commands show the 


underlying curve (see Figure 2.27). 


plot3d(rule_surf (<cos(u)*cos(2*u)| sin(u)*cos(2*u) | 


sin(2*u)>,map(diff ,<cos(u)*cos(2*u) |sin(u) *cos(2*u) 
|sin(2*u)>,u)) ,u=0..3.5+*Pi,v=-2..2,scaling=constrained, 


shading=XY, lightmodel=light2, orientation=[38,43] , 
grid=[40,20]); 
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Figure 2.25. The ruled surface 


Figure 2.26 Tangent developable 
(vcos(m), v sin(it), sin{#)) 


of a spherical curve 


Figure 2,27, Curve on sphere 


cu:=tubeplot (convert (<cos(u)*cos(2*u) |sin(u)*cos(2*u) | 
sin(2*u)>,list), u=0..2*Pi,radius=0.015,color=black, 
numpoints=200,scaling=constrained) : 
> sphere1:=plot3d(<cos(u)*cos(v)| sin(u)*cos(v)|sin(v)>, 
u=0.,2*Pi,v=-Pi/2..Pi/2): 
> display({cu,sphere1},style=wireframe, scaling= 
constrained, shading=XYZ, lightmodel=light3, 
orientation=[38 ,43]) ; 


> 


display({cu,spherei},scaling=constrained, 
shading=XYZ, lightmodel=light3, orientation=[38,43]); 


Exercise 2.5.7. Write a Maple procedure to create a parametrization for a surface of revolution 


given a plane curve a(u) = (g(u), (u)). Use your procedure to plot surfaces of revolution such 
as the catenoid and the torus. 
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Exercise 2.5.8. Use the two procedurse above to create ruled and revolution surface parametriza- 
tions for the hyperboloid of one sheet x? + y* — z* = 1. 


Exercise 2.5.9. Start with a cylinder parametrized by x(u, v) = (a cos(uz), a sin(u), v) and twist 
the disk at height c by an angle ¢. In order to determine a parametrization for this twisted cylinder, 
construct the line from p = (a cos(x), a sin(u), —c) to gq = (a cos(u + @), a sin(u + @), c). 
The parametrization is a(v) = p + v(q — p) which has the form 
x(u, v) =(a cos(u) + v(a cos(u + ¢) 
—a cos(u)), a sin(u) + v(a sine + @) —a sin(u)), —c+20vc). 
This is a parametrization of the twisted cylinder. Below, use the given Maple commands 


to show that this parametrization is that of a hyperboloid of one-sheet (of revolution) with 
equation 


where we have! 


> with(plots):with(LinearAlgebra) : 
The parametrization of the twisted cylinder is 


> cyltwist:=<a*cos(u) ,a*sin(u) ,-c>+v*<ax*cos(utphi) 
-a*cos(u) ,a*sin(utphi)-a*sin(u) ,2*c>; 


At z=0, we have v = ;- We also have x + x = |, Therefore, x? + y? = A*. We can 
simplify this equation to get 


> simplify (expand(eval ((a*cos(u)+v*(a*cos (utphi)- 
axcos(u)))°2+(a*sin(u) +v* (a*sin(u+phi)-a*sin(u)))°2,v=1/2)), 
symbolic)=A“2; 

> A=simplify (sqrt (1/2*a72*(1+cos(phi))),symbolic) ; 


So we have the correct A. Now we can plug this, as well as v = | and x* + y? = a? into the 
hyperboloid equation and solve for C. 


> hypeq:=eval(a72/A°2-c72/C°2, A=axsqrt ((1+cos (phi) )/2))=1; 


> solve (hypeq,C); 
> C#simplify(solve(hypeq,C) [2] ,symbolic) ; 


'Thanks to John Reinmann for coming up with the correct A and C. 
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Figure 2.28. A twisted cylinder is a hyperboloid of one-sheet 


Now verify that the A and C found actually work to give the correct hyperboloid equation as 
above. 


> simplify(expand(cyltwist [1] ~2/(a72*(1+cos(phi)) /2)+ 
cyltwist [2] ~2/(a~2*(1+cos(phi))/2) 
-cyltwist [3] ~2/(c72*(sin(phi)*2)/(1-cos(phi))~2))); 


Therefore, the twisted cylinder is a rotation hyperboloid of one-sheet with A and B dependent 
on the angle of twist @. The following procedure plots the twisted cylinder given a radius of the 
circle at height c, an angle of twist @ and a height c. 


> cyltwist:=proc(a,phi,c) 

local Z; 

Z:=<a*cos(u) ,a*sin(u) ,-c>+v*<atexpand(cos(utphi) )- 
a*cos(u) ,a*expand(sin(u+phi) )-a*sin(u) ,2*c>; 

print (‘surface=',Transpose(Z)); 
plot3d(Z,u=0..2*Pi,v=0..1,scaling=constrained,shading=XY, 
lightmodel=light2) ; 

end: 


> cyltwist(2,5*Pi/4,6); 


Curvatures 


3.1 Introduction 


As we have seen, objects such as the shape operator and the principal curvatures give a great 
deal of information to us about the geometry of a surface. We have also seen, however, that it 
may be difficult or impossible to actually compute these objects precisely. In this chapter we will 
introduce two computable “invariants” of a surface which are associated to the shape operator 
via linear algebra. In the next chapter we will then consider various results which indicate the 
kind of information these objects provide. 

The two most basic linear algebraic invariants associated to a linear transformation are its 
determinant and its trace. Because the shape operator at a point p is a linear transforma- 
tion, we may define two geometric quantities in terms of the shape operator’s determinant and 
trace. 


Definition 3.1.1. Let S denote the shape operator of a surface M. 
The Gauss (or Gaussian) curvature of M at p € M is defined to be K(p) = det(S,). 
The mean curvature of M at p € M is defined to be H(p) = 5 trace(S,). 


These two quantities are fundamental to the study of the geometry of surfaces, as we shall see. 
Moreover, even though K and H are defined in terms of the shape operator, we will see that they 
may be calculated by calculus alone. Therefore, we are led back to our main premise that linear 
algebra is the bridge which allows geometry to be studied via calculus. We have seen that the 
matrix of the shape operator with respect to a basis of principal vectors is given by 


ki(p) 0 
0 kop) }” 
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with determinant and trace equal to k, kz and k; + k, respectively. Hence, 


_ ki tk 


K = kiko, H 5 


Recall that if —U is chosen as a normal instead of U, then k(u) changes sign also. Since Gauss 
curvature is the product of two such changes, it does not change sign. Note, however, that H 
does change sign under a change of unit normal. These observations are important because the 
sign of K has meaning. 

Suppose K(p) > 0. Since K = k,k2, k; and kz must have the same sign. But k; = max k(u) 
and k2 = mink(u), so k(u) has the same sign for all u. If k(u) > 0 for all u then M bends toward 
U in every direction. If k(u) < 0 for all u, then M bends away from U in every direction. 


Exercises 3.1.2. Interpret K(p) < 0 and K(p) = 0. What are the pictures? 


Example 3.1.3. (Bending and Gauss Curvature) 


1. For K(p) > 0, consider the elliptic paraboloid z = x? + y* at p = (0, 0, 0). The paraboloid 
is tangent to the x y-plane at p and, clearly, whichever vertical normal is chosen, either all 
k(u) are positive or all are negative. 


2. For K(p) < 0, consider the hyperbolic paraboloid z = x* — y* at p = (0, 0, 0). Again, the 
tangent plane at p is the x y-plane. This time, however, the surface both bends away and 
towards either chosen normal. 


Exercise 3.1.4. The case K(p) = 0 may arise in two ways exemplified by (1) a plane and (2) a 
cylinder x? + y? = 1. Discuss these in terms of normal curvature. 


Now we will focus on straightforward ways to compute curvature in terms of calculus and 
linear algebra. Let v and w be linearly independent tangent vectors at p € M. This means simply 
that one vector is not a multiple of the other. Since they are linearly independent, v and w forma 
basis for 7,(M) and any vector is a linear combination of them. Hence, we may write the effects 
of the shape operator as 


S(v) = av + bw and S(w) = cv + dw. 


This says that the matrix of S with respect to the basis {v, w} is E ‘| Now det(S) = ad — be = 
K and tr(S) = a + d = 2H by definition, so we obtain 


S(v) x S(w) = (av + bw) x (cv + dw) 
= ac(v x v) + ad(v x w) + bce(w x v) + bd(w x w) 
= 0+ (ad — bc)(v x w) +0 
= det(S)v x w 


=Kvxw, 


3.1. Introduction 109 


S(v) x w+ v x S(w) = (av + bw) x W+4+ Vv x (cv + dw) 
= a(v x w) + d(v x w) 
=(a+d)(v x w) 
= tr(S)v x w 
=2Hvxw. 
Exercise 3.1.5 (Lagrange Identity). Show that for vectors v, w, a and b, 


(v x w)- (a x b) = (v- a)(w- b) — (v- b)(w- a). 
Hint: Write out the vectors in coordinates (v!, v?, v?) = v etc. 
Now combine Exercise 3.1.5 with the formulas above to get 
(S(v) x S(w))- (v x w) = K (v x w)- (v x Ww) 


(S(v) - v)(S(w) - w) — (S(v) - w)(S(W) - v) = K ((v- v)(w- w) — (v- w)(W- v)) 


__ (S(¥) - VSO) - w) — GUY) - WX SOW) +) 


(v- v)(w- w) — (v- w)(w- v) 


K 


and 


(S(v) x w+v x S(w))- (v x w) =2H (v x w)- (v x W) 
= (S(v) - v)(w - w) ~ (S(v) - WOW - v) 
+ (v- v)(S(w) - w) — (v- w)(S(w) - v) 
=2H ((v- v)(w- w) — (v- w\(w- y)). 


Finally, by dividing through, we obtain 


7 CO = OW) wn a) GO) oy Os): 


2 ((v- v)(w- w) — (v- w)(w- ¥)) 
Exercises 3.1.6. Use Euler’s formula (Corollary 2.4.11) to show 


(1) the mean curvature H at a point is the average normal curvature 


1 2a 
He =| k(0) dé. 
2n 0 


(2) H = (k(v,) + &(v2))/2 for any two unit vectors v; and v2 which are perpendicular. Hint: 
If the angle from v; to u; is ¢, then the angle from v2 to u, is@ + 45. 


Exercise 3.1.7. Show that the principal curvatures are given in terms of K and H by 


ki =H+JH?-—K and k=H-WVH?*-K. 
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Hint: k; and kz are the eigenvalues of S. Find these for S = [3 ] by setting the characteristic 
polynomial A? — tr(S)A + det(S) equal to zero and using the quadratic formula to solve. 


There is another more geometric way to view Gauss curvature in terms of the Gauss map 
G: M — S*. We have seen in Chapter 2 that the derivative of the Gauss map is the negative of 
the shape operator, G,(v) = —S(v). Take the basis {x,, x,} and consider 


G,(Xu) x G,(%) = (-—S(x)) x (—S(x,)) 


= KX, X Xp. 


Now, |G.(x,) x G.(X,)| and |x, x x,| may be thought of as infinitesimal pieces of area of 
the image of the Gauss map on S? and M respectively. The formula above then says that the 
ratio of these infinitesimal areas is precisely |K|. Another way to say this is the following. 
Let U/ be a small open neighborhood of p € M and suppose that / is contracting down to p. 
Then 


. AreaG(U) 
|K} = lim ———— 
u>p Areall 


This expression means that the magnitude of Gauss curvature measures the way in which the unit 
normal expands or contracts area. We also may identify the area in the image of the Gauss map 
as follows: 


Proposition 3.1.8. For M: x(u, v), the area of G(M) is equal to the total Gauss curvature of M, 
fy K dA. 


Proof. We compute 


Area (GM) =f IGe(%y) X GalXy)] du dv 
M 
= [ \(—S(ay)) x (—S(x,))] du dv 


a K |x, x Xy| du dv 
M 


= K dA. 
M 


5 


The Gauss map will be of greater importance to us when we study the complex-analytic 
approach to minimal surfaces in Chapter 4. With this in mind, we introduce some convenient 
terminology. A surface M is said to be flat if K(p) = 0 for every p € M and it is said to be 
minimal if H(p) = 0 for every p € M. We will consider minimal surfaces extensively in two 
later chapters and we will see exactly where the term “minimal” comes from. The term “flat” is 
derived from the fact that the prime example of a flat surface is the plane. We have already seen 
that the shape operator of a plane is identically zero, so it is immediate that its determinant, the 
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Gauss curvature, is identically zero as well. It may not be so obvious however that other surfaces 
may be flat. 


Exercise 3.1.9. Use your knowledge of the shape operator for the cylinder x? + y? = R? to show 
that a right circular cylinder is fiat, but not minimal. 


Exercises 3.1.10. Show that if M is minimal, then K <0 on M. Hint: look at principal 
curvatures. 


Exercise 3.1.11. Show that the R-sphere S?(R) has K = 1/R?, first by considering the deter- 
minant of the R-sphere’s shape operator and, secondly by considering the Gauss map of the 
R-sphere and how area changes. 


3.2 Calculating Curvature 


The formulae for K and H for general v and w may be particularized to x, and x, when a patch 
x is given for M. With this in mind we introduce the following traditional notation: 


E=Xy-Xu, FHxX,-Xp, G=X,)-X, 
E = S(Xy)- Xu, m = S(Xy)-Xy = Sy) + Xu, n = S(Xy) + Xp. 
Then, replacing the general v, w by x,, X,, we have the two curvature formulas 


= (S(xu) : Xu )(S(Xy) Z X) _ (S(x,) : Xy)(S(Xy) : X,) 


K= 
(Xu * Xu )(Xy + Xv) — (Ka * Xv)(Xy + Xv) 
— In-m 
~ EG — F? 


= (S(Xu) > Xu)(Xy + Xv) — (S(Kx) + Xu (Xp * Ku) + (Ky + Xu )(S(Ky) + Xv) — Ke + Xv (SOY) +» Xu) 
2 ((Xu + Xu)(Xy + Xv) — (Ky + Xy)(Xv + Xu) 


fel 


im Gl+En-—2Fm 
"(EG — F2) 


Remark 3.2.1. The quantity EG — F? has already made an appearance as |x, x x,|*. See 
Exercise 1.3.5 (Lagrange’s identity). 


Now we come to the fundamental result which allows for computation of K and H without 
reference to the shape operator S. The result follows immediately from Theorem 2.3.5. 


Lemma 3.2.2. /=U-x,,, m=U-Xy, n=U-Xpy. 


Exercise 3.2.3. Prove the formulas in Lemma 3.2.2. 
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Exercise 3.2.4. Suppose x(u, v) and y(r, s) are two patches for the same surface M. For y"! 0 
x(u, v) = (r(u, v), s(u, v)), show that 


ExGy — Fy = [ruSy — rySu] (EyGy — Fy) 
Iynx — m. = [TuSv — rSu](lyny = my) 
and, consequently, Kx, = Ky. 


Exercise 3.2.5. Suppose x(u, v) is a patch. Define a new patch by y(u, v) = c x(u, v), where c is 
a constant. Show that 


Now specialize to the case where x(u, v) is the standard parametrization of an R-sphere. Does 
your answer make sense? 


Exercises 3.2.6. From a surface M: x(u,v), construct a parallel surface M': x'(u,v)= 
x(u, v) +t U(u, v). Show that the Gauss and mean curvatures are given by the formulas 


K H—Kt 


————————r | 2 ae nas Ris 
1—2Ht+ Kt? 1—2Ht+ Kt? 


where K and H are the Gauss and mean curvatures of M. For simplicity, assume F = 0 and use 
the standard formulas for U, and U,. After verifying the formulas above, show the following: 
(1) if M has constant mean curvature H = c #0, then M!/@°) has constant Gauss curvature 
K'/@c) = 4c?, (2) if M has constant Gauss curvature K = c* > 0 (and no umbilic points), 
then M'/¢ has constant mean curvature H'/° = —$. Hint: compute E'G' ~ FO = |x x xt? = 
(1 —2Ht+ Kt?)(EG — F*) directly from the definition of x'(u, v) and compare. We will use 
parallel surfaces when we discuss minimal and constant mean curvature surfaces in Chapter 4. 


Exercise 3.2.7. Let M: x(u, v) be a surface. Recall that a curvea: J > M isa line of curvature 
if the tangent vector a’(r) is an eigenvector of the shape operator for each t. Show that all u 
and v-parameter curves (in a non-umbilic region) are lines of curvature if and only if F = 0 and 
m = 0. Hints: for one direction compute x, - x, two ways using the lines of curvature hypotheses. 
Show that either M is all umbilic (hence, is a sphere) or F = x, - X, = (E/1)m = 0. For the other 
direction, write S(x,) = ax, + bx,, S(x,) = cx, + dx, and compute the coefficients by taking 
dot products with x, and x,. 


Now let’s concentrate on computing Gauss and mean curvature for various surfaces we have 
met previously. We begin with 


Example 3.2.8 (Enneper’s Surface Curvatures). 
Let M denote Enneper’s surface with the standard parametrization 


uw v 
x(u,v) = (u- Sm vist uu’, eaa?), 
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From the parametrization, we can compute the metric coefficients. First, we require x, = (1 — 
u? + v*, 2uv, 2u), x, = (2uv, | — v? + u?, —2v) and we then have 


E=Xy + Xy 
= 142u?+2v* + u4 + 2u?v? + v4 
=(1tu?+0°y, 


F=X,-X, 
= 2uv — 2u>v + 2uv? + Quv — 2uv? + 2u3v — 4uv 
= 0, 


G=Xy'Xy 

=4u°v? +1—-yv4w—v tut vw tu? — wv? +4 +40? 
=142u? +2v* +u4 + 2u7v? + v4 
=(1+u? +0’). 

The unit normal U is found by taking 

XX Xy = (—4uv? — 2u + 2uv? — 2u3, 2v — 2u?v + 2v? + 4u7v, 1 — (u? — vv?’ — 4u?v?) 
= (—2u(1 + u? + v’), 20(1 +? + v?), 1 — (v? + v’)*) 
with length squared 
ix, x xp? =4u2(l+u2 ++ 4rd tute’ tty —-20 4+ vy +1 

= Au? +7) + 8? +’? +40? +? 4 (W? + *)f — AW? +?) +1 
=(u? +’) + 4a? + vo’) + 6(u? +??? + 4? +02) +1 
= (u? ty 1) 


which also may be computed by noting the consequence of the Lagrange identity, |x, x x,| = 


VEG — F?. Then, 


—2u 2v 1-(’ +’) 
U= ; ; : 
w+vet)l wt+vu2til (uet+tvu24+1P 
The second partials x,,, = (—2u, 2v, 2), Xy» = (2u, 2u, 0) and x,, = (2u, —2v, —2) then give 
L=x,,-U 
4u2 4y? 2—2(u? +’? 


peed “yea ed (u2 + v2 + 1)2 
— ue +u? +19 

(v2 + v2 +1) 
= 2. 
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Similarly n = x,, -U = —2 and, clearly, m = x,,-U =0. Then the Gauss curvature of En- 
neper’s surface is 


K- In — m? - —4 
~ EG-—F? (u24+024+1)4 
and the mean curvature is 
oe Gl+En—2Fm 
"(EG — F?) 


_ w+ v2 +12) + (2 + v? +:1°(-2) - 0 
i 2(u? + v? + 174 


= 0. 


Hence, Enneper’s surface is a minimal surface. We will see exactly how its parametrization arises 
in Chapter 4. 


Exercise 3.2.9. Show that the u and v parameter curves of the catenoid and Enneper’s sur- 
face are lines of curvature. Also, show that these curves are planar (Hint: calculate torsion). 
Finally, show that the following surface also has parameter curves which are planar lines of 
curvature: 


_ (cu tsinucoshvy vtccosusinhy 
aac JI-c V1—c? 


Hint: use Maple. Enneper’s surface, the catenoid and this last surface are the only minimal 
surfaces with planar lines of curvature. 


, +cosu cosh ») : 


Exercise 3.2.10. Using K = (In — m*)/(EG — F*), show that the Gauss curvature of the R- 
sphere S*(R) is 1/R?. 


Example 3.2.11 (Hyperboloid of Two Sheets Curvature). 
Let M denote the hyperboloid of two sheets 


x? y? 2 


ap 
parametrized by x(u, v) = (a sinhu cosv, b sinhusinv, c coshuz). Then 


xX, = (acoshu cos v, bcoshu sin v, c sinhu) 


Xy = (—asinhu sin v, b sinhu cos v, 0) 


and x, xX X, =(—bc sinh’ u cos v, —ac sinh? sin v, ab sinh u cosh u). Dividing by |x, x x, 
gives 


Xy X Xp 
U = —_ 
Ww 
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where 
W = Vb2c? sinh? u cos? v + a2c? sinh’ sin? v + a2b? sinh? u cosh? u. 
We then have 
E =a? cosh? ucos* v + b’ cosh’ u sin* v + c* sinh’ u, 
F = —a’ sinhucoshu sin v cos v + b* sinhu cosh u sin v cos v 
G =a? sinh? u sin* v + b* sinh? u cos? v 
with 


EG — F* = bc’ sinh* ucos? v + a’c’ sinh’ sin? v + a*b’ sinh? u cosh? u 
= W? 
The following second partials then give /, m and n. 
Xyu = (asinhucos v, bsinhu sin v, c cosh u) 
Xuv = (—acoshu sin v, bcoshu cos v, 0) 


Xyy = (—a sinhu cos v, —bsinhu sin v, 0); 


l=x,,°U 
_ abe sinh? u cos? v — abc sinh’ u sin? v + abc sinhu cosh? u 
~ WwW 
bc sinh 
= — using 1+ sinh? u = cosh? u, 


mM=Xy-U 

abc sinh? u coshu sin v cos v — abc sinh* u cosh u sin v cos v 
Ww 

= 0, 


n=Xy-U 


abc sinh? u cos? v + abc sinh? u sin? v 


Ww 
abc sinh’ u 
= 7 : 
Hence, we obtain the Gauss curvature 
In—m? — abc? sinh* u 
he Se 
EG — F? w4 


which we may write as 


w2 \? 
k=1/ (Gam) 
abc sinh* u 
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where 
w be. ac, : ab 
a a sinh u cos? v + — sinh? u sin? v + — cosh’ uw. 
abcsinh’u =a b c 


Now, the coordinate functions of the parametrization are x = asinhucosv, y = bsinhu sinv 
and z = ccoshy, so the reader can check that the Gauss curvature may be written in terms of x, 
y and z as 


1 


K= ae 
2p22;e2 4 Pa 2 
BORE! | ae 


Exercise 3.2.12. Show that the Gauss curvature of the hyperboloid of one sheet x(u, v) = 
(acoshucosv, bcoshusinv, csinhu) may be written in Cartesian coordinates as 


1 
| a a 
woo[e+e+8] 


Hint: Maple may be useful here. 


Exercise 3.2.13. Show that the Gauss curvature of the elliptic paraboloid 


may be written in cartesian coordinates as 


l 
K= 


7 4q2p2 [= ae y ae ie 
a4 bf 4 


Exercise 3.2.14. Show that the Gauss curvatures of the hyperbolic paraboloid 


may be written in cartesian coordinates as 


K= 


a 
2 2 1 
4a*h? [s + a 7 1 


Exercise 3.2.15. Find the Gauss and mean curvatures of the helicoid given by x(u, v) = 
(ucos v, usin v, bv). 


Exercise 3.2.16. Find the Gauss and mean curvatures for the saddle surface M: z = xy. Take a 
Monge patch x(u, v) = (u, v, uv). 


Exercise 3.2.17. Compute the Gauss curvature of Kuen Surface 


2(cosu +usinu)sinv 2(sinu —ucosu)sinv v 2cosv 
x(u, v) = - ‘ - : In(tan ) + : : 
1+ u? sin’ v 1+u? sin? v 2 1+ u? sin? v 


3.2. Calculating Curvature 117 


Exercises 3.2.18. This exercise deals with Gauss curvature of ruled surfaces. 
(a) Show that a ruled surface x(u, v) = B(u) + vd(u) has Gauss curvature 


m2 _ —(p’ -8x 8’)? 


K = —-— —_¥ = 
EG — F? wt 


where W = |f’ x 6 + vd’ x 4]. Using ruling patches, 
(b) Compute K for M: z=xy. 
(c) Compute K for a cone x(u, v) = p + vd(u). 
(d) Compute K fora cylinder x(u, v) = B(u) + vq. 
(e) Compute K for the helicoid. 
(f) Compute K for a hyperboloid of one sheet. 
(g) Compute K for the hyperbolic paraboloid. 


Interpret your computations in terms of the geometry of the surface. Compare your calcula- 
tions with previous ones. Hint: recall that EG — F? = |x, x x,|* by using Lagrange’s identity 
Exercise 1.3.5. 


Exercise 3.2.19. A cone (minus its vertex) is a surface. For the standard cone z = /x? + y?, 
compute K and H using the two patches: 


(a) x(u, v) = (u, v, Vu* + v2) (i.e., a Monge patch). 


(b) x(u, v) = v(cosu, sinu, 1) (i-e., a ruling patch). 


Exercises 3.2.20. A ruled surface M: x(u, v) = B(u) + vd(u) is developable if its unit normal 
U is constant along the rulings. Hence, U does not depend on v, so U, = 0. Show that a ruled 
surface is developable if and only if its Gauss curvature is zero. (In fact, this is true even without 
the assumption “ruled”.) See Section 5.7 for an industrial application. Hint: the key is to look at 
m. Consider (x, - U), for one direction and (x, - U), for the other. 


Exercise 3.2.21. Show directly that cones and cylinders have constant normals along rulings 
and, therefore, are developable. 


Exercise 3.2.22. Besides cones and cylinders, typical examples of developable surfaces are ones 
which arise from curves, the tangent developables. Given a curve B(u), its tangent developable 
is defined to be the surface parametrized by x(u, v) = B(u) + vB’(u). Show that this surface is, 
in fact, developable. Graph the tangent developable of the helix B(u) = (cosu, sinu, u). See 
Figure 3.1 and Figure 2.24 on p. 103. 


Exercise 3.2.23. Let 6B: 1 — M be a curve on a surface M with unit normal U. Show that 
B is a line of curvature on M if and only if the surface defined by y(u, v) = B(u) + v U(u) is 
developable. Here, U(u) denotes the normal of M along B. 
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Figure 3.1. (Half of) tangent developable of helix 


Exercise 3.2.24, Let M be a developable surface with no umbilic points and suppose @ 1s a unit 
speed line of curvature on M corresponding to the nonzero principal curvature. Show that a is 
perpendicular to each ruling line. Hint: show 6 - U = 0 and differentiate. 


Exercise 3.2.25. Let @ and M be the respective line of curvature and developable surface 
in Exercise 3.2.24. Now parametrize M by x(s,f) = a(s) +f B(s) with |A(s)| = 1. Show that 
B'(s) = A(s)a'(s) for some function A(s). 


Exercise# 3.2.26. For M: z= f(x, y): 
(a) Give a Monge patch for this surface. 
(b) Compute E, F, G, 1, m,n. 

(c) Find formulas for K and H, 


(d) Recall that a critical point for f is a point (ug, Uo) which satisfies f(g, vo) = 0 and 
J.(uo, Yo) = 9. The point (uo, vo) may be a maximum, a minimum or a saddle point. The 
2™_Derivative Test is designed to decide which of these choices pertains. It goes like this. 
First, compute D = fy,(uo. Vo) frv(to. Yo) — (fur(Uo. Yo)? Lf D = 0, then the test fails 
and no information is obtained. [f D < 0, then (uo. vo) is a saddle point. If D > 0, then 
there are two cases: 

(1) If fixy(do, Yo) > 0, then (Ug, vy) is a Minimum. 
(ID) If fiu(to, vo) < 0, then (up, vo) is a maximum. 


(In the above, /,, could equally well be used.) 
Show that this test works! Hints: (1) How is K related to D? (2) Consider K > 0, K < 
0, K =0. (3) What do these imply about normal and principal curvatures? (4) Recall 
k(v) = S(v)- v for v a unit vector. If w is not a unit vector, then it can be made one by 
w/|w|. Thus 
S(w) w= (S(w/|w|) ow/|wi)lw)? Why? 
= k(w/|wi)iwl?, 


(5) fuu(uo, Vo) = U (ug, Vo) Xuu(Uo, Vo). Where (ug, Up) is a critical point. 
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Figure 3.2. A surface of revolution 


3.3 Surfaces of Revolution 


We have already seen that many interesting examples of surfaces arise from revolving 
curves about an axis. These surfaces of revolution (see Figure 3.2) have patches of the form 
x(u. v) = (g(u), A(u) cos v, h(u)sinv). The curves on the surface which are circles formed 
by revolving a single point about the axis are called parallels (denoted 2). The curves 
which look exactly like the original curve (but rotated out) are called meridians (denoted 
jt). Also, recall that g() is the distance along the axis of revolution and (uw) is the radius 
of a parallel. We have, x, =(g’. hk’ cos v.h'sinv), x, = (0, —Asinv, Acosv) and x, x Xx, = 
(hh', —g’hcosv, —g’hsinv). Hence, 


v= (h', —g' cos v, ~g' sinv) 
Jez" 4 hit 


Also, the second partial derivatives are x,,,=(g2",h’cosv,h" sinv), xX,» =(0,—A’sinv, 
h' cos v), and x,, = (0, —A cos v, —h sin v). Hence, 


E=g" +h". F=0. G=h’, 


= (ath! —h's’) —eE} m=0 n= —— 


Finally, the Gauss curvature is computed to be 


I 


x a Sten he). 
h(g’ +h) 


Exercise 3.3.1. Verify the computations above. 


If the original curve a(/) = (g(1), A(z), 0) has g'(¢) + 0 for all 7, then g is strictly increasing. 
Thus, it is one-to-one and has an inverse function g~' which is differentiable as well. Such an 
inverse allows us to reparametrize the curve a. Define f = 4 o g™! and get 


@u)=awog '(u) = (gg '(u), hem '(u), 0) 
=(u, f(u). 0). 
Thus, our calculations become somewhat easier. For instance, the formula for Gauss curvature 


becomes K = —f"/ f(1 + i ). To avoid confusion, we will still write a(u) = (u, h(u), 0) and 
K =—h"/hL +h? yp. 
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Figure 3.3. A torus 


Example 3.3.2 (Torus Curvature). 


x(u, v) = ((R +rcosu)cosv, (R+rcosy)sinv, rsinu) 


hiu) glu) 
cosu 
~ 7(R+rcosu)’ 
Observe that, for = <ut< cm we have cosu > 0, Hence K > 0 on the outer half of the torus. 
For u = ——, = we have cosu = 0, so K = 0 on the top and bottom of the torus. Finally, for 
7 In 


5 <u < = wehavecosu <0 and K < 0 on the inner half of the torus. Note that the maximum 
K occurs at u = 0, 


KR}. = —_—_— 
r(R +r) 
on the outermost circle, while the minimum K occurs at u = 7, 


1 
K(x) = -———— 
= Ra 
on the innermost circle. 
Exercise 3.3.3. Verify the formula for K above. 


Exercise* 3.3.4. For a surface of revolution parametrized as above, the v-parameter curves 
are parallels and the u-parameter curves are meridians. The normal curvatures in the directions 
ofx, and x, are denoted k, and k,, respectively. Show that k, = //E andk, = /G by calculating 
the eigenvectors of § as x, and x, with eigenvalues //E, n/G respectively. This shows directly 
that meridians and parallels are lines of curvature. Hint: S(x,) = ax, + bxy. Use F = 0, m = 0. 


Exercise 3.3.5, For a general surface of revolution, show that 


, 


= gn h' = gh" 3 g 
— (2? +h? pi" aT h(g” + h’*)'/2 

and obtain formulas for K and H. What is k, on a torus? What is the geometric reason for 
this? 


“ 


Exercise 3.3.6. Revolve the catenary y = ¢ cosh(*) around the x-axis to obtain the catenoid. 
Show the following: 


i==>—-> bark. Ks 


=. H=0. 
e cosh” . 


c? cosh* & 
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Figure 3.4. Tangent line segment of length c 


Can you think of another surface of revolution which is minimal? 


Exercise 3.3.7. For the following surfaces of revolution, find K and describe where K > 0, 
K <0,K =0. 


(a) Revolve a(u) = (u, e~“/?, 0) about the x-axis. 


foe et RY 
(b) Elliptical Torus. Revolve the ellipse —_,— + ia 1 about the y-axis. 
a 


Exercise 3.3.8. If a surface of revolution is generated by a unit speed curve a(u) = 
(g(u), h(u), 0), then show (a) E = 1, F =0, G=h? and (b) K = —h"/h. 


Example 3.3.9 (The Pseudosphere). 
As another example of our computational techniques, we will consider a surface of revolution 
which is analogous to the sphere in the sense that it has constant Gauss curvature. This surface 
is called the Pseudosphere and it is defined by means of a condition on the generating curve. 
Namely, let a be a curve which begins at (0, c) and traces a path so that its tangent line at any 
point reaches the x-axis after running a distance exactly equal to c. This means the curve a must 
decrease and flatten. The curve a@ is called a tractrix and is physically represented by the path a 
ship would follow when starting at (0, c) and is being pulled by a tugboat which moves along the 
x-axis (see Figure 3.4). 

Write a(u) = (u, h(u)) and note h’ < 0 and g(u) = u. Now, a’(u) = (1, h’(u)) and a tangent 
line at (u, h(u)) has the equation 


I(t) = a(u) + ta'(u). 


The y-coordinate of [(t) is then h + th’, so for t = 0 we are at a(u) and fort = —h/h’ we are on 
the x-axis. We want the length of this segment to be c, so we have 


h 
c = |a(u) — (a(u) — pit wl 


A a'(u) 
= —la’(u 

1 

h 

— V1 +h”. 


~ Th" 


122 3. Curvatures 


Then c* = Pal a h”) = +h?. Solving for h’ and then taking another derivative, we get 


hy’ h h” h'c? 
i 1 =" _ pau: 
Vc? —h? (ce? — hy? 
Exercise 3.3.10. Show that, for the surface of revolution obtained by revolving @ about the 
x-axis, we obtain (a) k,, = h'/e (b) k; = —1/ch’ and (c) K = —1/c’. 


Note that the Gauss curvature of the pseudosphere is a constant. However, in contrast to a 
sphere’s constant positive curvature, the pseudosphere’s curvature is negative. For « = 1, we 
obtain a surface at each point of which the Gauss curvature is —1. Later we will meet up with 
more geometrically natural surfaces of this kind. 


Exercises 3.3.11. Solve the separable differential equation 
h 
vl-r 


in two ways to get (1) « = In [vee + t|—J1 =F? and (2) A(w) = sech w, u = w — tanhw, 
For the first, turn the picture over so we revolve the tractrix about the z-axis and the pseudosphere 
(with ¢ = 1) lies over the disk of radius 1. If we write things in cylindrical coordinates (r. @, z), 
then the old w becomes z and the old 4 becomes r. By substitution in the formula above, we get 


z=—vVv1—r? —In(r) 4+ In(1 + V1 —r?). 


Of course, rectangular coordinates may now be introduced by r = \/x? + y*. Use this formula 
to graph the pseudosphere via Maple. For the second, we obtain a patch for the pseudosphere 


i’ =- 


XQ, v) = (w — tanh w, sech w cos v. sech w sin v). 
Plot this patch with Maple also. Compare the two plots. Hints: For the first, use trigonometric 


substitution with 4 = cos w. For the second, use hyperbolic trigonometric substitution. 


The pseudosphere (see Figure 3.5) is actually determined by the requirement of constant 
curvature and certain initial conditions. Suppose we start off by requiring that a surface of 


Figure 3,5. A pseudosphere 
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Figure 3.6. A constant negative curvature surface of revolution 


revolution M have constant Gauss curvature K = —1. Wecan suppose also that M is parametrized 
by x(u, v) = (g(u), h(u) cos v, A(u) sin v) with e(uy? + h'(uy? = | (i.e., the profile curve is unit 
speed). Then, as we have seen, the formula for Gauss curvature reduces to K = —h"/h, so the 
requirement of constant curvature K = —1 gives us a linear differential equation to solve, 


h" =h. 
The solution to this differential equation is easy, We obtain 
h(u)= Ae“+ Be. 


Then, plugging A(u) into the the unit speed relation g'(u)* + h'(u)? = | and solving for g(u) 
gives 


eu) =+ [ V1 —(Ae! — Be) dt. 


In choosing A and B, new surfaces of constant Gauss curvature K = —! are created. See 
Figure 3.6. 
Exercise 3.3.12, Let A= 1 and B =0- Show that this surface of constant Gauss curvature 


K =—| is the pseudosphere with c = |. Hint: let e’ = sech w. 


Exercise 3,3.13, Carry out the same procedure for the case of positive Gauss curvature K = +1. 
What should A and B be to get the unit sphere’? See Exercise 3.7.8 and Section 3.8. 


3.4 A Formula for Gauss Curvature 


So far, we have seen that Gauss curvature tells us a great deal about the geometry of a surface. 
However, since we compute K using the unit normal U, it seems that the precise way in which 
we situate a surface in R? may well change K while leaving the essential geometry unaltered. 
This would mean that Gauss curvature would not be an invariant of the geometry of the surface. 
We shall now give a formula for K which obviates the use of U and shows that K depends only 
on E, F, and G, the so-called merric of the surface. In fact, although a more general formula 
exists (see Exercise 3.4.7 below), we shall restrict ourselves to the case where F = x, -x, = 0. 
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That is, we shall assume that the u and v-parameter curves always meet at right angles. In this 
case we prove 


Theorem 3.4.1. The Gauss curvature depends only on the metric E, F and G. When F = 0, 
Gauss curvature is given by 


1 a E, a Gy, 
* K= + : 
=) 2/EG G (zs) du (5) 
Here, we have used the notation 


E Las a ) d G a 2 ) 
= LE = (ky Xu an w= 779 = 7 (he: Xp). 
"av av.“ du du 


Of course, to prove the theorem, we will show that our usual formula K = (In — m*)/(EG — F?) 
reduces to the one above. Notice that the formula of the theorem does not depend on the particular 
patch used, but only on the metric coefficients E, G (and implicitly F). We will consider this a bit 
more carefully later when we discuss isometries of surfaces, but suffice it to say that, intuitively, 
a surface may be “bent” without stretching and still have the same Gauss curvature. The prime 
example of this phenomenon is the usual right circular cylinder. If the cylinder is cut along a ruling 
it can be made to lie flat on a plane without stretching. This means that, locally, the cylinder 
and the plane have the same geometry. This further explains, geometrically, the vanishing of 
the cylinder’s Gauss curvature. The dependence of K on the metric alone is known as Gauss 
Theorem Egregium (meaning “remarkable theorem”). 

We point out two things. First, this result does not mean that two surfaces with the same Gauss 
curvature are the same geometrically. Surfaces may have the same formulas for Gauss curvature 
while still having fundamentally different geometries encoded in their individual metrics. To 
make this precise we need the notion of isometry, so for the moment we give 


Exercise 3.4.2. Show that the following two patches have the same Gauss curvature: 
x(u, Vv) = (ucosv,usinv, v), y(u, v) = (ucos v, usin v, Inu). 


These surfaces are not isometric as will be shown in Exercise 5.5.2. 


Secondly, the Theorem Egregium holds for Gauss curvature, but not for mean curvature. For 
instance, as we have mentioned, a cylinder and a plane are geometrically the same with the same 
Gauss curvature K = 0, but their mean curvatures are quite different. The mean curvature of the 
plane is zero while that of the cylinder is one-half the curvature of the base circle. Thus, mean 
curvature is not truly an invariant of the surface itself, but depends on the way in which the 
surface sits in R?. 

We now turn to the derivation of the formula in Theorem 3.4.1. While the details of the calcu- 
lations are quite tedious, keep in mind that we are simply finding coefficients for a vector in terms 
of a particular basis. Because / = x, -U,m =X, -U andn = xy, - U, we need expressions for 
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Xyu» Xuy, and xy, in terms of the basis for 3-space {x,, x,, U}. Write 
Xuu = Ty, Xe + T,X +1U (since x,,-U =/ etc.) 
Xyy = PyyXu + yXe + mU 
Xyy =P x, + PY xy +AU. (**) 


Our job is to find the ’’s. These are just coefficients in a basis expansion, but they are known by 
the name Christoffel Symbols. We use what we know about dot products to determine the I"’s. 


Xuu (Xy = Ti ky -X, +040 
=V7 E£ by definition of E. 
If we can compute x,,, - X,, then we will know I'¥,. This is simply the product rule. 
E=x,-X,, SO Ey =Xyy-Xy +Xy°Xuy = 2Xuy * Xy- 


Thus 


E, E 
Xyu Xy = oa and [= Th 


Further, x, - x, = 0 so taking the partial with respect to u gives 
0= Xuu (Xv t Xu + Xyy OF Xyy + Xy = —Xy * Xuv- 


Also, E = x, - X,, So taking the partial with respect to v gives E, = 2x, - x,» and, consequently, 
Ey/2 = Xy * Xyv = —Xuy - Xy. Moreover, 


Vie = Kun Xy)/G = —E,/2G and Vy, =xw-%u/E = Ey/2E. 
Continuing on, G = x, -Xy, So G,/2 = x,y - Xy. Then, since 0 = x, - x,, we have 
—Xy ° Xyv = Xuv + Xy 
with 
Phy = Xuv °Xv/G=G,/2G and TY, =Xw-x,/E = —G,/2E. 
Finally, x, -X, = G, So Xp, -xy = G,/2 and T'},, = x,y -x,)/G = G,/2G. We end up with the 


following 


Formula(s) 3.4.3 (Fundamental Acceleration Formulas: F = 0 Case). 


Eu Ey 
Xuu = yom as 7G +1U 
Ey G, 
Xyy = EM + 9G +mU 
Gy G, 
Xyp = ope + 7G*° +nU 
1 m 
U, => —_ = GY 
m n 
U, = ae GY 
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Exercises 3.4.4. Compute U,, and U, as above. Hint: U, a V,,U = Ax, + Bx,. Find A by 
V,,U -x, = Ax, +X, = AE and0=x,(U -x,) = Vx,U -x, + U + Xyy. 


We know that mixed partial derivatives are equal no matter the order of differentiation, so 
Xyuv = Xuvu> OF Xyuv — Xyvy = 0. This means that the coefficients of x,, x, and U are all zero 
when X,u» — Xyvu 1S written in this basis. Let’s concentrate on the x,-term. (We again use the 
product rule repeatedly.) 


E E E E 
Xuuv = (3) X, + spk a (53) * — sgh + LU +1Uy. 
Now replace xX,,, X,, and U, by their basis expansions to get, 


E,G, E, E,G, In 
Se eed U 
4EG ee) 4G? a |e +0) 


E, E, Gy Gy 
Xuvu = E Xy t+ SS Xuu + Xy + Xv + m,U + mvu, 


Xuuv =[ Ix, +| 


2E 2G 2G 
E,Ey (Gu\ G,G, m? 
= ieee Eyes U. 
me c+] 7a 6 a@ Gi |ettl 


Because the x,-coefficient of x,,» — Xuvy 1S Zero, we get, 


-biGe E, E,G, E,E, G, G,G, In—m 
~ 4EG 2G), 4G?  4EG \2G), 46 G 


Notice the last term! Dividing by E, we have 


 E\2G/), 4EG2' 4E°G E\2G 4EG2" 


in=an? BiG, (2 EG, Eyky: 1G, GyGy 
EG  4E°G E P , 


Of course, the left-hand side is K (since F = 0) and the right-hand side only depends on E and 
G. Thus, we have a formula for K which does not make explicit use of the normal U. 


Exercises 3.4.5. Show that the right-hand side above is 


soa (2 (ha) (Se) 
2VEG \du\ JEG) du \ VE , 
Exercises 3.4.6, Compute the curvature of the sphere of radius R using this formula. Does this 


agree with what the shape operator tells you? 


Exercise 3.4.7. Show, by similar arguments, that, if F 0, the following formula holds. 


OY rsacc eke eee 
_ Gu Ey 
Ge F G & F G 
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Exercise 3.4.8. Show that the U-terms of X,u. — Xuvu = 0 and Xvvu — Xuvy = 0 give 


G, Ey 
lj,—m,=m|—--—-—|+E£,H 
2G 2E 


2E 2G 2E 2G 
and 
BS) «Gy 
ua Ny = Sng are aa G, H 
te m(3 a) + 
Ge ym (Ee _ Ge), Gu 
=/_—+m(— - — — 
2E iE 9G] 3G 


where H is mean curvature (and F is assumed to be zero). These are the Codazzi-Mainardi 
equations. What do they say in the case where the surface is minimal (i.e., H = 0) and the 
parametrization satisfies E = G, F = 0? 


3.5 Some Effects of Curvature(s) 


In this section we will derive some geometric consequences of various conditions on Gauss and 
mean curvatures. Of course, the point of all this is simply to show that we have not wasted our 
time so far in learning how to compute K and H. These quantities really do tell us about the 
geometry of a surface. In the next chapter we will focus on mean curvature and see that it is 
intimately related to certain structures in Nature. For now, however, our goal is simply to see 
geometry reflected in the linear algebra and calculus represented by K and H. 

Let us start by considering a completely impractical situation! Namely, suppose we know 
that every point on a surface is an umbilic point. Recall that this means that, at every point, the 
principal curvatures are equal. Although this does not mean, @ priori, that the principal curvatures 
are constant functions on the surface (with the same value), this is precisely what we shall prove 
below. Of course, a direct verification of such a hypothesis is out of the question, yet we shall 
see later that this result is useful all the same. First try the following 


Exercises 3.5.1. Suppose p € M is an umbilic point (i-e., ky; = kz = k at p). Show that, at p, 


ad n 
SEG 
Moreover, if F #0, then k = % as well. Hint: Write S(x,) = Ax, + Bx, = kx, by Theo- 


rem 2.4.10 since p is umbilic. Use B = 0 in 
[= S(x,)-X,=AE+BF, m=S(x,)-x, =AF+ BG. 


Solve for A. Do the same for S(x,). 


Theorem 3.5.2. A surface M consisting entirely of umbilic points is contained in either a plane 
or a sphere. 
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Proof. Although we know that k,; = k, =k atany p € M, itis still possible that k may vary from 
point to point in M. Now, by Theorem 2.4.10, we see that 

V;x,U = —S(x,) = —kx, and V,x,U = —S(x,) = —kx,. 
Now the mixed covariant derivative may be calculated by Exercise 2.2.9. 


O7uz Ox; OX; Bug Ox; 
Vx, Vx,U = 
Xy YX, d a 2, Ox;0x; du dv = Ox; Oudv 


i j 


= Vx, Vx,U 


so differentiating —kx, by v and —kx, by u gives —k,x, — kX,» = —k,X, — kX», and, hence, 
k,X, = kyX,. But x, and x, are linearly independent, so we must have k, = 0 =k, and this 
implies that k is a constant function. 

Case 1. Suppose k = 0. Then the shape operator is zero everywhere. By Theorem 2.2.17, M 
is contained in a plane. 

Case 2. Suppose k 4 0. Then consider x(u, v) + iU and compute, 


a 1 1 
ou [at v) + iu] =X + pu 
1 
=x, + pace by above 
= Xy — Xy 
In a similar fashion 
: (u,v) + ly 0 
— |x(u,v)+— = 0. 
dv k 
Hence, x(u, v) + ;U = p,aconstant. Then |x(u, v) — p| = | | = rie so every point x(u, v) 


lies a distance 1/|k| from p. Hence, x(u, v) lies on a sphere of radius 1 /|k}. 

We note that we have proved the result only for a single patch. However, because M is 
connected, any patch must overlap some other patch. For a patch which overlaps x(u, v), the 
points in the overlap lie on the plane or sphere as above. Since the argument above produces a 
constant k on the new patch, it must therefore agree with the k of x(u, v). Hence, the geometry 
of x(u, v) propagates to overlapping patches. This then continues over the whole surface, each 
patch reproducing the geometry of its neighbors. O 


Notice that, for an all umbilic surface, by Exercise 3.5.1 we obtain 


n me 2 
Po REG) HE? aE G SE). ne 


~ EG-F2  EG-F?  —— EG- F2  — E? 

so |k| = /K and the radius of the sphere is 1//K. The next result restricts the form surfaces in 
R? may take in terms of their possible Gauss curvatures. Recall that a surface in R® is compact 
if it is closed and bounded. Here, “bounded” means that the surface may be enclosed by a sphere 
of sufficient size. The term “closed” means that any sequence of points on the surface which 
converges to some point in R? actually converges to a point on the surface. In the following result 


In-—m 


K 
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the hypothesis of “compactness” is used to ensure that maximums and minimums of functions 
are actually attained by the functions. 


Theorem 3.5.3. On every compact surface M © I there is some point p with K(p) > 0. 


Proof. Let f: M — R be defined by f(p) = |p|*. This function is continuous, so (since M is 
compact) f attains its max and its min. Let po be a max for f. Now f(po) = | po|?, SO po is the 
farthest point from the origin on M. If r = | po|, then M is inside the sphere of radius r as shown 
in Figure 3.7. 

Therefore, it seems likely that K( po) => 4. where the 4 is the curvature of the r-sphere $?(r), 
We verify this and so prove the theorem. Consider a unit tangent vector u € 7,,(M) and take a 
curve a in M with a(0) = po and a’(0) = u. Of course the composition f o a still has a max at 
Po (since @ lies in M), so we have the usual calculus conditions 


d 

—(fe -9 = 0 

ane a) |,=0 

a 

—(fa -p <0. 

Priel @) lino < 
Now f oa@(t)= Ja(1)/? = a(t) a(t), so 4(f oa)=2a-a’. Atr=0, 

0 =2a(0) - @(0) =2py - uo. 

But u Is any unit vector in 7),,M,so po-u=0 for all u € 7,,M. Hence, py is normal to M at 


po. Also, (d?/dt?)( f oa) = 2a’ - a! + 2a- a” and attr = 0, 


2 


00> — 
~ dpe 


(f o@) = 2a'(0)- a'(0) + 2a(0)-a"(0) 


=2u-ut+2po:u'(0) 
=2+4+2po-a"(0) 
—2>2py-a(0) 
—1 > po-a"(0). 


Figure 3.7. Surface at the farthest point py 
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Now, let’s calculate normal curvatures at po. Since po/r = po/|po| iS a unit normal to M at po, 
the calculations above give 


k(u) = S(u)-u 
= S(a’)-a’ 
= U(po)- a” 
Po 3 a” 


<--. 
In particular, both k; and k» are less than or equal to —1/r. Thus the Gauss curvature has 


1 
jet ee 


] 
K (po) = ki(po)k2(po) = - 5 
ror or 


O 


Corollary 3.5.4. There are no compact surfaces in R? with K < 0. In particular, no minimal 
surface embedded in R? is compact. 


We now come to one of the main results displaying the influence of curvature on geometry. 
Previously, we saw that an all umbilic surface is either a plane (K = 0) ora sphere (K > 0 and 
constant). There is, however, a much stronger result which is based on the Gauss curvature K 
itself rather than its factors k; and kz. This beautiful result is 


Theorem 3.5.5 (H. Liebmann). /f M is a compact surface of constant Gauss curvature K, then 
M is a sphere of radius L/VK. 


Proof. By Theorem 3.5.3, there is some p € M with K(p) > 0. Since K is constant, then K > 0 
everywhere on M. Because K is constant and K = k,k2, where k, and k, are the principal 
curvatures, if p is a point where the function k; is a maximum, then p is also a point where kz is a 
minimum. We know that such a p exists since M is compact and k,, k2 are continuous functions. 
We now have two cases to consider. 

Case 1. Suppose k; = k2 at p. Because k, is a maximum and k2 is a minimum at p, all 
normal curvatures on M must lie between these two values. Therefore, if k; = k at p, the normal 
curvature k must be constant on M. Hence M is all umbilic with K > 0, so it is a sphere of radius 
1/K. (This is what we meant by saying that Theorem 3.5.2 would prove useful.) We shall now 
show that Case 2 below is impossible, so that Case 1 proves the theorem. 

Case 2. Suppose k; > k at p. There is actually a small open neighborhood of p in M where 
k, > kz. Ifnot, we could take a decreasing sequence of open “balls” about p and choose points a; 
in the balls with k;(a@;) = k2(a;) anda; — p.Then, by continuity, we would have k,(p) = ko(p). 
This contradicts our assumption, so we must have the desired neighborhood. Although we shall 
not prove it, on this neighborhood we may choose two principal directions which are perpendicular 
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at each point and use these to provide a patch with x, -x, = E, x, -x, = 0, xy -x, = G and, 
since X,, Xy are principal vectors, 


S(x,) = kix,, S(Xy) = koXy. 


(A proof of the existence of such a patch x may be found in [dC76, p. 185].) Note that m = S(x,) - 
Xy = k,x, -X, = 0 also. Moreover, / = S(x,)-X, = kX, +X, =k, E, sok, = £ and, similarly, 
ky = @. Of course, we have the formula, 


«=-s786 lie (Zee) +a (Zea) 


and from the U-term of x,» — Xuyy = 0 (see Exercise 3.4.8), we get 


0 onra ae Eyl 

~ 9G "OR 

phot n 

"" DIE G 
Ey 

= —(k; +k). 
Bt 2) 


Similarly, using the U-term of Xyy. — Xvuy = 0 we get 


G, [1 n G, 
v= t{l4¢4] = +). 
n [e+e y (ki + ka) 


If we differentiate the formulas k; = £ and k; = 7 repeatedly and use the expressions for /,, and 
n, we get, 


| E, 
kiy = RF bo 23 E,l) = ape = ky) 


] G, 
ky, = Ga lGnu — Gyn) = Yetue —k2) 
kivy = shag — ki) + E,(...) 

2E 


Guu 
Kouy = ——(ky —k G,(...). 
2 xg 2) + Gil...) 
Now, at p, k; is a max and k, is a min, so 


ky = 0 Kivy <0 
ky, = 0 koyu = 0. 


Plugging these estimates into the equations above we get, at p, 


E, =9 Ey = 0 
G, =90 Guu = 9. 
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Note that, even though E, = 0 = G,, at p, the expressions 


(ee) ™ au (Zea) 

av EG ou EG 
may be nonzero at p since the differentiation is done before the evaluation at p. These expressions 
then give us, at p, 


1 
K= —3RG (Ew + Guy) < 0. 


But this contradicts our hypothesis that K > 0, so our assumption k; > k2 must be incorrect. 
Hence Case | holds and M is a sphere. O 


Notice that in the course of the proof we have discovered a famous result of David Hilbert 
(which will prove useful in Exercise 3.5.10). 


Lemma 3.5.6 (Hilbert). /fk, has a max at p, ky has a min at p and k, is strictly greater than ky 
at p, then K(p) < 0. 


So far, we have concentrated on the effects of Gauss curvature on geometry. We now balance 
this somewhat by giving a result which indicates the effect mean curvature has on the geometry 
of a surface. Recall that a surface is minimal if the mean curvature is zero at each point of the 
surface. Just as for other geometric qualities, we may test the notion of minimality on classes 
of surfaces which we understand best. In Chapter 4 we shall consider other types of minimal 
surfaces, but for now let’s look at the surfaces of revolution. 


Theorem 3.5.7. [fa surface of revolution M is minimal, then M is contained in either a plane 
or a catenoid. 


Proof. Consider the profile curve of M, a(u) = (g(u), h(u)). If g(u) is a constant function, then 
M isaplane. If g’(u) # 0 ata point, then it is nonzero in an open interval around that point. In that 
interval then, g(u) has an inverse function, so we could reparametrize the interval of definition so 
that g(u) is replaced by a new variable w and the profile curve becomes a(w) = (w, h(w)). Now, 
we must have one case or the other throughout the domain of M because a’ = h’/g’. Therefore, we 
shall concentrate on this case in what follows. Take a patch x(u, v) = (u, h(u)cosv, h(u)sinv) 
for M. Then 


ya beth 1 eae 1 )-! —hh" +1 +h? 
2 2A HhP PR A$ APIAT  2N A+ h?B2 PO 


The surface M is minimal, so H = 0 and, thus, hh” = 1 +h”. Let w = h’. Then 


,_dwdh dw 


hk’ = coh eed ey ee 
tS die odie. 
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Here we consider w as a function of A on an interval with h’(u) 4 0. We can do this because, 
on such an interval, # has an inverse function f with u = f(h). Then, taking a derivative and 
applying the chain rule gives 

_ dfdh _ df 


1 
=Ghdao dh 


af 
dh 


or w= 
so that w is a function of h since f and df/dh are. Thus, hh” = 1 +h” implies hw (dw/dh) = 
1+ w*; or 
w ] 
——~dw = —dh., 
peut =k 
After integration, we get, 


In(V 1+ w?) =Inh+C 
Vl+w*=ch where c = e°, 
w=tvVc2h2-1. 


Now, w = e, so (dh)/(Vc*h2 — 1) = +du and integration yields (with ch = cosh, | > 0) 


1 
-l=tu+D 
c 

I -1 

— cosh” ‘(ch) = tu+D 

c 


1 
h = —cosh(cu + D). 
c 


Therefore, M is part of a catenoid. O 


Exercise 3.5.8. In the proof above we assumed that h’(u) #0. What happens if h’(u) = 0? 
Explain. 


Exercise 3.5.9. Show that a flat (K = 0) surface of revolution is part of a cone or a cylinder. 
Hint: Model your proof after the theorem above. Then see Exercise 4.9.1. 


Exercise 3.5.10. Show that if M is a compact, connected (oriented) surface with K > 0 and H 
constant, then M is a sphere of radius TE: 

Hints: (1) Consider the function H? — K = (k; — k2)*/4 and let p € M be a max for it. (2) 
If H? — K(p) =0, show K is constant and use Liebmann’s Theorem (Theorem 3.5.5). (3) 
Suppose H* — K(p) > 0 (so ki(p) > ko(p)). Note (k; — k2)(p) is a max. (4) Use Hilbert’s 
lemma, Lemma 3.5.6. 


3.6 Surfaces of Delaunay 


Just as we saw that surfaces of revolution of constant Gauss curvature are determined by certain 
integrals with chosen initial conditions, we can ask whether something similar occurs for constant 
mean curvature. Let’s take a surface of revolution M: x(u, v), where the patch has the form 
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x(u, v) = (u, h(u) cos v, h(u) sin v). The mean curvature is given by 
_ 1 hh" +1+h? 
2 hl +h? yr 
Suppose H = c/2 is constant. We then have the differential equation 
1+h? —hh" = ch(1+h")?. 


First, consider the case when c = 0. The differential equation then reduces to 1 + h’” — hh” = 0. 
This is precisely the equation we solved to get the catenoid. Of course this must be the case since 
the condition c = 0 is saying that M is a surface of revolution with zero mean curvature. Secondly, 
suppose c = +1/a with a > 0. We get (for c = —1/a), 1 +h’ — hh" = =! h(1 +h”)? or 


a(i +h’) — ahh" 


+h=0 
(1 +h): 
2 
a = h” 
oh! a(l+h’) = apcey 
(1+h?): 
2ah'(1 +h’) — 2ahh’h" 
A al Jee 4+2hh' =0 


(1 + h’)3 


d 2ah ae 
du} /i+nh? 
a constant. 


Note that using the integrating factor h’ introduces constant solutions into the situation. If we 
choose to ignore these and take all other cases into consideration (noting that the steps above 
may be reversed), we obtain the general form of the differential equation describing surfaces of 
revolution of constant mean curvature. 


Theorem 3.6.1. 4 surface of revolution M parametrized by x(u, v) = (u, h(u) cos v, h(u) sin v) 
has constant mean curvature if and only if the function h(u) satisfies 


2ah 


vi+h" 


Wt =+p° 


where a and b are constants. 


It is an amazing fact discovered by Delaunay that this differential equation arises geometrically. 
That is to say, there is a geometric construction which produces the differential equation above 
and, consequently, all surfaces of revolution of constant mean curvature. We shall consider one 
example below in detail and then simply give the geometric characterization in a theorem. 


Example 3.6.2 (The Roulette of an Ellipse). 
Suppose that an ellipse rolls without slipping along the x-axis. If we follow the path of one of the 
foci of the ellipse as it rolls, then this path gives an example of a roulette — a curve formed by a 
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A 


Figure 3.8. Ellipse rolling on the x-axis 


Pp’ 


point associated to one curve as it rolls upon another. Consider the “rolling” diagram Figure 3.8, 
where F and F’ are the foci of the ellipse, K is the point of (tangent) contact with the x-axis 
and FT is the tangent to the curve traced out by F. (Note that we use T to stand for a point 
here, but we will also use T for a unit tangent vector to a curve as well. No confusion should 
result.) 

We will use various properties of the ellipse which were, perhaps, better known in years past 
(see [Gra52, Yat74, Zwi63] for example). Let the ellipse have major axis of length a and minor 
axis of length b. Then it is known that the sum of the distances from the foci to a given point on 
the ellipse is constant and equal to 2a. In particular, FK + F’K = 2a. Furthermore, the ellipse 
has the beautiful (and useful) reflection property that any light ray originating from one focus will 
reflect off the ellipse to the other focus. Since the angles of incidence and reflection are equal, this 
means that 9“! ZF K P = ZF’K P’. A more esoteric property of the ellipse (its pedal equation) 
is the fact that the product of the lengths of the line segments from the foci to any tangent line 
which meets the tangent perpendicularly is a constant equal to the square of the minor axis. In 
our case this means that F P - F’P’ = b?. Finally, a general property of this sort of roulette is the 
fact that the direction of the normal to the curve traced by F passes through the point of contact 
with the line. Here, this means that F K is perpendicular to FT. 

Now, the x and y coordinates of F are given by OP and FP respectively. Then y = FK sing. 
Because 7K FT = 2/2, we have 7FT P = 1/2 —8. Moreover, 7FT P is the angle between 
the tangent of the traced curve and the x-axis. That is, for the unit tangent of the traced curve 
Ta(Gap) 

: Xu dx 
sin @ = cos (= -6) =T-(1,0= a 


Hence, y = FK &. Also, since 7F K P = ZF’K P', we have ¥ © F’P’ = F'K 42. Note that 
dx 


2 d 
yt FS(EK PK) — = 20 
ds ds 


by the fundamental property of the ellipse. Also, the pedal equation gives 
y=b. 
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Solving for y in the second equation and plugging 11 into the first equation gives 


4 dx 7 
y — 2Zay— + 0° = 0. 
ds 

Now, the arclength is given by s= | /1 + y? dx, so a =1//1+ y”. Replacing an in the 
equation above and noting that its sign changes when 4 FT K is large enough, we obtain 

° 2ay , 

Pk es + * =f 
vl+y" 

This is precisely the equation we obtained from the assumption of constant mean curvature. There- 
fore, the roulette of an ellipse, an undulary, is a profile curve (i.e., meridian) of a surface of revo- 
lution of constant mean curvature, an unduloid (see Figure 3.9), We note first that, for this case, 
we must have a > b. Also, we note that the companion equation y* + 2ay//1+ y*—b? =0 
is the differential equation describing the roulette of (the focus of) a hyperbola, a nodary. 


Exercise 3.6.3. What is the roulette obtained when b = a? Hint: think geometrically. 


Exercise 3.6.4. What is the roulette obtained when b = 0”? Hint: solve the differential equation 
and then interpret geometrically. 


Exercise 3.6.5. Find the roulette of the focus of a parabola as it is rolled along the x-axis. 


Exercise 3.6.6. Show that the differential equation y* — 2ay/y1 + y’* + b? = 0 may be put in 
the form 


dy /aazy2?—(y2 +h 


dx yt+bh 


-3 


Figure 3.9. An unduloid 
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A surface M is immersed in R? if there is a mapping f: M — R? which may not be one-to- 
one, but which has a one-to-one derivative map f, at every point. Also, although we shall treat 
the idea of completeness in Chapter 5, we mention here that one property of a complete surface 
is that it is not part of some larger surface. Finally, recall that the word “conic” refers to any of 
circles, lines, parabolas, ellipses and hyperbolas. While we have shown that roulettes of conics 
are constant mean curvature surfaces, we now state without proof the more difficult theorem of 
Delaunay. 


Theorem 3.6.7 (Delaunay). A complete immersed surface of revolution of constant mean curva- 
ture is a roulette of a conic. 


More comprehensive discussions of the surfaces of Delaunay may be found in [Eel87, Eel78] 
(which we have mostly followed above) and, of course, [Del41]. When we talk more about 
surfaces with constant mean curvature in Chapter 4, we shall see that they come about as the 
result of a variational principle (again, see [Eel87, Eel78]). That is, informally, the surface takes 
its shape to “minimize” surface area subject to having fixed volume. This principle has both 
physical and biological consequences. In particular, if a soap bubble is formed between two rings 
(with the ends included), then it holds a certain amount of air inside. If the amount of air is just 
right, then the bubble is a cylinder, the surface of Delaunay associated to the roulette of a circle. 
If the amount of air is changed or if the rings are pulled apart to lengthen the bubble, then the 
waist of the bubble narrows and an unduloid results. These types of experiments are discussed in 
D’Arcy Wentworth Thompson’s book On Growth and Form [Tho92]. In Chapter V of this classic 
text, Thompson describes informally how surface tension and pressure combine to determine the 
shapes of various one-celled creatures — shapes suspiciously Delaunayan! 


3.7 Elliptic Functions, Maple and Geometry 


Many of the integrals which arise in differential geometry cannot be computed in terms of 
elementary functions. But just because we cannot find answers that are familiar to us from 
freshman calculus, we should not believe that these integrals are beyond our understanding. One 
tool that is particularly useful in analyzing such integrals is the subject of elliptic functions. The 
easiest way to understand elliptic functions is to consider them as analogues of the ordinary 
trigonometric functions (see [Gre92]). From calculus, we know that 


arcesin(x) i. ae 
Intx) = oF 

0 Vl -u2 
Of course, if x = sin(t)(—2/2 < t < 2/2), then we have 


sin(t) du 


J1—u 


t = arcsin(sin(t)) = 


In this way, we may view sin(t) as an inverse function for the integral. Now, fixing some k with 
0 <k <1 (called the modulus), we make the 
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Definition 3.7.1. The Jacobi sine function sn(u, k) is the inverse function of the integral 


sn(u,k) dt 
u= —=—==—————. 3.7.1) 
0 V1 —tvV1 — kt? 
More generally, we write 
F(z, k) [ Z (3.7.2) 
Zzh)=f ——>—, ak 
0 V¥1l—Pv1 — kr? 


and call F(z, k) an elliptic integral of the first kind. An elliptic integral of the second kind is 
defined by 


z /| — k2¢2 
E(z,k)= / —— dt. 
0 Vvl—? 
When z = 1 in F(z, k) and E(z, k), then these integrals are respectively denoted by K(k) and 
E(k) and called the complete elliptic integrals of the first and second kind. 
The Jacobi cosine function cn(u, k) may be defined in terms of sn(u, k); 
2 2 = 
sn*(u, k) + cn*(u, k) = 1. 
A third Jacobi elliptic function dn(u, k) is defined by the equation 
dn*(u, k) + k? sn*(u, k) = 1. 


The integral definition of sn(u, k) makes it clear that, sn(u, 0) = sin(u). Of course, cn(u, 0) = 
cos(z) as well. 


Remark 3.7.2. If we make the substitution t = sin(@) in the definitions of F(z, k) and E(z, k), 
then we get the following trigonometric forms of the elliptic integrals: 


‘ do : 
Fiw,k)j)= —_———., EW, k= 1 — k? sin?(@) dd, 
Ww.) = [° — a | [ sin’(G) do 


where y = arcsin(z). In particular, when z = sn(u, k), we get sin(y) = sn(u, k) and F(y, k) =u. 


The derivatives of the elliptic functions can be found from the definitions (but also see [Mey01] 
where the elliptic functions are essentially defined in terms of their derivatives!). For instance, 
let’s compute the derivative of sn(u, k). Suppose in (3.7.2) that z = z(u). Then 


dF = dF dz 1 dz 


du dzdu-s JU —~pJl —-keedu 
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But, from (3.7.1), we know that, when z = sn(u, k), we have F(z, k) = u. Hence, replacing z by 
sn(u, k) and using du/du = 1, we obtain 

d sn(u, k) 


1 
J/l—snu, 2/1 —esntu,ke du 
dsn(u, k 

EG) oe, pena E/E ae 


1 


du 
k 
d sn(u, k) = cn(u, k) dn(u, k). 
du 
We also have 
wk d dn(u, k 
con ) = —sn(u, k) dn(u, k), me ) = —ksn(u, k)on(u, k). (3.7.3) 
u Uu 


Maple has built-in elliptic functions, so we can easily plot them as follows. We take k = 1//2 
for concreteness. 


> with(plots): 


We will need to have a numerical value for K(1//2). We can find this as follows using the fact 
that the complete elliptic integral arises when z = sn(u, k) = 1. 


> k1:=1/sqrt(2) ; 


> fsolve(JacobiSN(u,k1)=1,u); 
1.854074677 


This computation can also be made using Maple’s “EllipticK” command. The plots of the elliptic 
sine sn, cosine cn and dn may be seen in Figure 3.10. 


> plot ({JacobiSN(u,k1) ,JacobiCN(u,k1), JacobiDN(u,k1)}, 
u=-1.854074677 . .3*1.854074677) ; 


We see from Figure 3.10 that sn(u, k) and cn(u, k) are periodic, but what is the period? We can 
determine the period by recalling (3.7.1) 


sn(u.k) dt 


0 V1 — 02/1 — k212 


k= 


and 


! dt 
k= ———__—_—_———.. 
ae I V1 —t2/1 — kr? 
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Figure 3.10. Elliptic sine, cosine and dn 


We see that, from the first equality, sn( K(k), k) = 1. Clearly, from the graph, we find that K(k) is 
1/4 of the period of sn(u, &). Of course, this can be verified analytically, but this argument suffices 
for our purposes. Also, the identity, sn’(u, k) + cn?(u, &) = | implies that cn(u, k) has the same 
period as sn(u, k) and that cn(K(k), k) = 0. So, the value of the complete elliptic integral K(k) 
can be found also by calculating where the elliptic cosine vanishes. 


Example 3.7.3 (The Arclength of an Ellipse). 
We parametrize an ellipse by a(t) = (asint, bcost), whereO < t < 2m anda > b. Thearclength 
integral is then 


2n 
Lo= [ V2 + y2 dt 
0 


n/2 
4 [ Va? cos*(t) + b? sin2(t) dt 
0 


2n 
4a i, V1 —e? sin2(t) dr, 
0 


in which ¢ = ./a* — b*/a is the eccentricity of the ellipse. If we substitute u = sinr with 
du/J\ — u2 = dt, we then obtain 


1 [VS 2,2 
L=4a f ee dado BG): 
0 vl—-uv 


So we see that elliptic integrals arise in even the simplest of geometric problems. 
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Exercise 3.7.4. The lemniscate r? = a* cos(20) was another of the original motivations for the 
development of elliptic functions. Show that the parametrization 


a(t) = (a(cost) /cos(2t), a(sint) /cos(2r)) 


can be used to compute the arclength of the lemniscate as 4a(K (1//2)/ J/2). 
We will require the following result in our discussion of the unduloid (see [MO03a]). 


Proposition 3.7.5. The following identity holds.’ 


u _ p2 
i en*(a, k)dii = eh es EF (satu, k), k), (3.7.4) 


Proof. We compute from the definitions. 


E(sn(u, k), k) 1-k 

eae —- 2 F(sn(u, k), k) 
2 = l V1 — kt? el — k?) I dt 
Tdi Je ie Re JO -P)\— ) 


sn(u,k) Le k2 7? = (a -_ k*) 
dt 
i k2/1 — 22/1 — ket? 
sn{u,k) ig t2 
——— dt. 
0 V1 — k?t2 
Now make the substitution tf = sn(#, k) with dt = cn(#, k)dn(a, k) da and note that the limits of 


integration transform as follows. Whent = 0, we have sn(a#, k) = 0,soa# = 0. Whent = sn(u, k), 
then sn(#, k) = sn(u, k), so & = u. Hence, we obtain 


E(sn(u,k),k) 1 -K? 7 [ cn(ii, k)en(ii, k)dn(i, k) di 
a yee Su = = ane 


—.— F(sn(u, k), k) dn(i, k) 


[ en’ (ii, k) di. 


0 
oO 


From Exercise 3.6.6 we know that surfaces of Delaunay (or rather, their associated profile 
curves) are solutions to the differential equation 


dx = 
/4a? y2 —(y2+ bY 


(3.7.5) 


' Although F(sn(ug, k), k) = ug, we shall continue to write F(sn(ug, k), k) because simplification in Maple appears to 
work better in that case. 
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We have seen previously that the profile curves of surfaces of Delaunay may be characterized 
as those curves arising from rolling conics on a line; that is, the roulettes of conics. The case 
above corresponds to the roulette of an ellipse with axes a and b, a > b. These types of surfaces 
of Delaunay are called unduloids. The radicand in (3.7.5) is real just when y lies in the interval 
(a, B) with 


a=a-—VvJa’*—b?, B=a4 Va? —B?. 


Introducing the eccentricity ¢ of the ellipse, « = ,/1 — aa the endpoints of the interval can 
be rewritten in the form a@ = a(] ~—e), 8 = a(1+¢). This leads us to change variables as 
follows: 


y=aV/l+e?4+2esing, @ € [-7/2, 2/2]. 


Performing this change one gets 


ae cos¢dd 


Jl +e? + 2esing 


dy > y? +b? > 2a?(1 +e sing), 


and 


V4a2y? —(y +P? > 2ea* cos @. 


As a result, the integral in which we are interested becomes 


; (l+esing)dd 


x(p) =a — 
Jl+e242esing 


—n{2 


and can be split further into two types of integrals which themselves can be transformed into 
elliptic integrals. 


Lemma 3.7.6. For k = Flere let K = K(k) and E(k) denote the complete elliptic integrals of 
the first and second kind respectively, we have 


: A aia, k), k)) 
-x2J/p+qsing VP+4 -~ 
e sing do 2./pt+q 2p 
a = E(k) + E(sn(u, k), K)) + AK + F(sn(u, &), &)). 
Loa Te egeh gE Ei A) + FPR + at BH) 


Here, u corresponds to @ under the transformation sing = 1 — 2 sn°(a, k). 


Proof, We make the substitution sing = 1 — 2sn?(a#,k) with dé = —2 dn(a&) di (where, as 
usual, we shorten the notation for ther elliptic functions for fixed k). Note that 6 = —1/2 
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corresponds to —K under the substitution. 


‘s do _ f% —2dn(%) di 
-x2J/pt+qsing J-x VP +4 dn(i) 
__~2 / ‘ 
— SP+q Sx 
—2 
= Sree + F(sn(u, k), k)), 


where, again we use F(sn(u, k), k) =u. For the second type of integral, we make the same 
substitution to obtain 


_sing dd _ == : rn ie sn’(i) di 
= — u u u 
-rj2/p+qsing VP+4 J-xK VP+@ J-K 


=) 4 4 : 
ees ee ee oe 2%) dit 
Wea TUS ag ) = | tw ii 
0 u 
-Sk + »- sae (f omwars f en'(i) di) 
2 4 sae ee Be Pacaslae ait 
= eg K+ »-——(f ntaaa + [ en?(@)d) 
2 4 p+@q —q 
= Et BGK) = 
> pea oe) Fal 2q oe 2q 
p+4q -q 
a og yee a “) 
= (K+ 0) - (9 + EW - (p - OK 
~ J/p+q antag ae 
+(pt+q)E(sn(u)) — (p — q)u) 
= ek pi vert (E(k) + E(sn(u), /)). 
[EJ 
In our case, p = 1 + €* and q = 2e, so that 
2Jé 
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Taken together, the above considerations give us 
x(u) = -—a(i-—e)[F(sn(u, 4), k) + K(k)] 
—a(i+e)[E(sn(u, k), k) + E(k)] 


where K(k) and E(k) are the complete elliptic integrals of the first and second kind, respec- 
tively. We must also convert y(@) to y(u) by the transformation sing = 1 — 2sn*(u). This 
gives 


aJ(i +e) — 4esn2(u) 


| 4e 
— a(l +e) _ (+ey sn*(u) 


= a(lt+e)/1—k*sn2(u) 


yu) 


= a(i+e)dn(w). 


Revolving the curve (x(u), y(u)) around the x-axis leads to 


Theorem 3.7.7 ((MO03b)). A parametrization for the surface of Delaunay (i.e., an unduloid) S is 
given by 
x(u, v) = (x(u), y(u)cosv, y(u)sin v) 
where 
x(u) = —a(l —e)[F(sn(u, k), 4) + K(k)] 
—a(1+e)[E(sn(u, k), k) + E(k)] 


and 
yu) = a(i+e)dn(u). 


In Maple, we have 


> delau:=[-a*(1-epsilon)*(Elliptick(kk)+ 

EllipticF (JacobiSN(u,kk) ,kk)) - a*(1i+epsilon) * 
(EllipticE(kk)+EllipticE(JacobiSN(u,kk) ,kk)) ,a*(1+epsilon) * 
JacobiDN(u,kk)*cos(v) ,a*(1+epsilon) *JacobiDN(u,kk)*sin(v)]; 


delau := [—a (1 — €) (EllipticK(kk) + EllipticF(JacobiSN(u, kk), kk)) 
—a (1 + €)(EllipticE(zA) + EllipticE(JacobiSN(u, kk), kk)), 
a(1+e)JacobiDN(u, kk)cos(v), a(1 + €)JacobiDN(u, kk) sin(v)] 


We can plot unduloids (see Figure 3.11 and Figure 3.12) by writing a Maple procedure with 
inputs the parameters “a” and “ec” in the parametrization given above. First we need the Maple 
plots package. 


> with(plots): 
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Figure 3.11. An unduloid with « = 0,7 Figure 3,12 An unduloid with « = 0.3 


Note. Going from Maple 9 to Maple 10, an error was introduced into the procedure for Elliptic 
E. In order to correct this, give the command 


> ‘evalf/Elliptic/E11_E‘ :=parse(StringTools:-Substitute 
(convert (eval ('evalf/Elliptic/E11_E‘) ,string) ,"F_0","E_0")): 


Maple also seems to have changed its “simplify” command to its detriment. To partially correct 
this, define the command 


> mysimplify:=a->‘simplify/do‘ (subsindets(a,function, 
x->sqrt(x72)),symbolic): 


and then apply it to any expression you want to simplify further. 


> plotund:=proc(a,eps, theta, phi) 
local kk,uu; 
kk:=2*sqrt (eps) /(1+eps) ; 
uu:=fsolve(JacobiSN(u,kk)=1,u); 
plot3d(subs({u=u(t) ,v=v(t)}. [-a*(1-eps)*(EllipticKk (kk)+ 
EllipticF (JacobiSN(u,kk) ,kk))- a*(1+eps)*(EllipticE(kk)+ 
EllipticE(JacobiSN(u,kk) ,kk)) ,a*(1+eps)*JacobiDN(u,kk)* 
cos(v) ,a*(1+eps) *JacobiDN(u,kk)*sin(v)]),u=-uu, .uu,v=0,,2*Pi, 
scaling=constrained, axes=boxed, orientation=[theta, phi], 
shading=XY , lightmodel=light3) ; 
end: 

plotund(1,0.7,-79,70); 


> plotund(1,0.3,-65,70); 


The metric coefficients of the unduloid may be calculated by Maple using the EFG procedure 
of Section 2.5. 
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> EFG(subs(kk=2*sqrt (epsilon)/(itepsilon), delau)); 


2 
(-» + 4¢ JacobiSN («. 2 =) dy Diets ) a 
lt+e 


_4 
14+2e+¢? 


0, 


2: 
= (-# + senscobisn («.2%) -26-1) a 
l+e 


Using the relationships among the elliptic functions, we can simplify the results to 


E=4a' dru), F=0, G=a(l+ey dn?(u). 


Now let’s use Maple to calculate the coefficients of the second fundamental form and the mean 
curvature. We write the results of the procedure “Imn” of Section 2.5 in standard form because 
the Maple output is difficult to typeset. 


> Imn(subs(kk=2*sqrt (epsilon)/(1t+tepsilon), delau)); 
2 
4a dn[u, 2 ve cn(u, Ve e | 2sn(u, ple —l-e 
l+e l+e l+e 
14+2¢€+e¢? —4esn(u, 2 ve ) / 


2 2 
((-1-26-@ ssean(u2 ve ) fim (u 2) (+9) 
l+e l+e 


0 
2 
—cn 2 VE dn pee: a 2esn(u, 2X) 
l+e l+e l+e 
2 
+2esn(u2 “) -2e-2-1) / 
l+e 
2 2 
(f) +26 +2~4ean(«, 275) frasn(«.2 “)) 
lt+e lt+e 


These formulas (with k = 2./e/(1 + €) understood) reduce to 


as sn()?) : 
€ 


a aitily meme 0 (+e) (1 
= = 5 may, A=@ € = 
l+e l+e s 1+ 
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From these and the metric coefficients, we can easily compute the mean curvature to be H = 
1/(2a). We verify this using Maple. First, let’s write a procedure for mean curvature. (Note that 
this requires the EFG and Imn procedures of Chapter 2, but we shall repeat them in Section 3.8.) 


> MK := proc(X) 

local E,F,G,1,m,n,8S,T; 

S := EFG(X); T := wane 

E := $[1];F := S[2]; := $[3]; 

Love TELS = T(2]; = T[3]; 

sinplityCleniseen DaFem) / (24E4G- 2*F°2) ,sqrt,trig,symbolic) ; 
end: 


Maple then gives the following verification. 


> MK(subs(kk=2*sqrt(epsilon)/(1+epsilon), delau)); 


(1 +e) 00biCN(1 2 Y=) JacobiDN u, 2 ve 
l l+e, [+e 
2 - \2 2 
1 — JacobiSN{ «. 2 we: l4+2e+¢2 — 4 JacobiSN(u. 2 ve a 
Il+e l+e 


With a little work, this reduces to H = 1/(2a) and verifies that, in fact, unduloids (as all Delaunay 
surfaces) have constant mean curvature. The approach to the unduloid just given appears in 
[MO03b]. 

Now let’s consider surfaces of Delaunay which are solutions to the following equation (where 
we change the sign on b+). These solutions give rise to nodoids (see [For60)). 


3 2ay 2 


Ve ee i 


Vil+ a? 
yo 4+ 2ay cos(w) = be 


Here we let y’ = tan(y). Let b = a tan(@) (so that 2@ is the angle between the asymptotes of the 
hyperbola generating the nodoid by rolling on a line). Then we obtain the following by applying 
the quadratic formula to the equation y* + 2ay cos(w) — a? tan?(@) = 0. 


y=—acos(w)t ay/costw) + tan?(@) 


—acos(w)+a sec(0) //cos?(0) cos?(w) + sin2(@) 


—acos(w)ta sec(6),/cos2( )( I — sin*(y¥)) + 1 — cos2() 


=-acos(Ww)ta sec(@)V/I — cos?(@) sin2(y) 


= —a cos(v) + a sec(@)yv 1 — cos?(@) sin2(y). 


Then we get 


ay =asin(y) + asec(@) ssn) : 
dy V1 —cos?(@) sin2(w) 
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Now, dy/dx = tan(y), so we get 


a = acos(y) — asec(@) 


cos?(6) _cos*(@)cos*(yr) 
/1 — cos?(6) sin2(W) — cos?(@) sin2(w) 


= acos(y) — asec(@) 


V1 — cos?(6) sin2(v) 


(=e at = aoe) 


cos?(6)(1 — sin2(y)) ee 


= ae 6 
a cos(w) a sec( ) Ji _ cos2() sin?(y) 
a —sin’(6) + (1 — cos*(9) sin?(y)) + dl = cos?(6) sin’?(y)) 
l= cox6sin(y) — cos?(9) sin*(y) 
— sin asin’) 
1 — cos?(6) sin2(y) 
—(1 — Kk’) é 
Ji sin) 
where cos(9) = k (i.e., the elliptic modulus). Now by Remark 3.7.2, if we integrate from 0 to y 


(where y = arcsin(sn(u, k))), then we obtain a parametric formula for x in terms of a parameter 
u, where F(w,k) =u. 


= acos(y) — asec(@) 


= acos() — a sec(@) + /1 — cos?(6) a) 


=acos(w) — ; ( 1—k sm) 


x(u) = asn(u,k)+— (a — k*)u — E(u,k)). 
Of course, we also have an elliptic expression for y (where we use k* sn(u, k)? + dn(u, k)? = 1); 
y(u) = —acn(u, k)+ : dn(u, k). 


These are then the parametric equations for the profile curve (i.e., the nodary) of the nodoid. For 
a Maple procedure which plots all surfaces of Delaunay, see [Opr00]. 


Exercise 3.7.8. Suppose M: x(u, v) = (g(u), A(u) cos(v), (uv) sin(v)) is a surface of revolution 
with constant Gauss curvature K = 1. 


¢ Assume the profile curve (g(u), A(u), 0) is unit speed and show that this leads to the 
differential equation h” = —h (see Exercise 3.3.8). 


e Show hA(u) = Acos(u) + B sin(u). 
e Show 


au) = f vi — (—Acos(u) + B sin(u))? du . 


e Show that we can write g(u) as 


g(u) = / JVl—c?*sin2(u+ ¢) du 
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for c = / A? + B? and tan(¢@) = —A/B. Shift g(u) so that 


g(u) = 1 — c* sin2(u) du. 


Now see the procedure “plotposconstcurv” in Section 3.8. 


¢ For A = 0, show that the parametrization for M is 
x(u, v) = (E(u, B), B sin(u) cos(v), B sin(u) sin(v)), 


where E(u, B) is the elliptic integral of the second kind with modulus B. 


* Let B = 1/2 and plot M using the Maple commands which follow. Compare with the 
numerical Maple procedure “plotposconstcurv” of the next section. 


> ccposR:=<EllipticE(-u,1/sqrt(2)) | 
-1/sqrt(2)*sin(u) *cos(v) |-1/sqrt (2) *sin(u)*sin(v)>; 

> ccposL:=<EllipticE(u,1/sqrt(2))+ 2*EllipticE(1/sqrt(2)) | 
1/sqrt (2) *sin(u)*cos(v) |1/sqrt(2)*sin(u) *sin(v)>; 

> ccposi:=plot3d(ccposR,u=-1..0,v=0..2*Pi): 

> ccpos2:=plot3d(ccposL,u=-1..0,v=0..2*Pi): 

> display ({ccpos1,ccpos2},scaling= 

constrained, shading=z, lightmodel=light2, orientation=[88,60]); 


3.8 Calculating Curvature with Maple 


As might be expected, Maple may be used to calculate Gauss and mean curvatures as well as such 
necessary ingredients as the metric coefficients E, F and G, the quantities (sometimes referred to 
as the coefficients of the second fundamental form) /, m and n and unit normals. The procedures 
which carry out these calculations are listed below. Of course, for plotting surfaces and doing 
vector operations, we should always start with 


> with(plots):with(LinearAlgebra): 


Here are all of the basic procedures required to calculate Gauss and mean curvatures. The 
procedures are generally self-explanatory. The input is always a parametrization X and the 
procedure then carries out Maple calculus and linear algebra commands to compute the relevant 
differential geometric quantity. 


> EFG := proc(X) 

local Xu,Xv,E,F,G; 

Xu := <diff(X[1],u) ,diff(X[2],u) ,diff(X[3],u)>; 
Xv := <diff(X[1],v) ,diff(X(2],v) ,diff(X[3],v)>; 
DotProduct (Xu, Xu, conjugate=false) ; 
DotProduct (Xu, Xv, conjugate=false) ; 
DotProduct (Xv ,Xv,conjugate=false) ; 


aq PI 
Holl 
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simplify([E,F,G]); 

end: 

> UN := proc(X) 

local Xu,Xv,Z,s; 

Xu := <diff(X[1],u) ,diff(X[2] ,u) ,diff(X[3],u)>; 

Xv := <diff(X[1],v) ,diff(x[2],v) ,diff(X[3],v)>; 

Z := CrossProduct (Xu, Xv); 

s:=VectorNorm(Z,Euclidean, conjugate=false) ; 

simplify(<Z[1]/s{Z[2]/s|Z[3]/s>,sqrt,trig,symbolic) ; 

end: 

> lmn := proc(X) 

local Xu,Xv,Xuu,Xuv,Xvv,U,1,m,n; 

Xu := <diff(X[1],u) ,diff(X[2] ,u) ,diff(X[3] ,u)>; 

:= <diff(X[1],v) ,diff(X[2] ,v) ,diff(X[3] ,v)>; 

Xuu := <diff(Xu[1t],w) ,diff(Xu[2] ,u) ,diff(Xu[3] ,u)>; 
:= <diff(Xu[1] ,v) ,diff (Xu[2] ,v) ,diff(Xu[3] ,v)>; 


os 
< 
" 


ms 

fo 

< 
u 


:= <diff(Xv[1] ,v) ,diff (Xv[2] ,v) ,diff(Xv[3] ,v)>; 
U := UN(X); 

1 := DotProduct (U,Xuu, conjugate=false) ; 

m := DotProduct (U,Xuv,conjugate=false) ; 

n := DotProduct(U,Xvv,conjugate=false) ; 
simplify([1,m,n] ,sqrt,trig,symbolic) ; 

end: 


os 

< 

< 
W 


Finally we can calculate Gauss curvature K and mean curvature H as follows. 


> GK := proc(X) 

local E,F,G,1,m,n,S,T; 

S$ := oe T= mea 

E := S[i]; = §[2]; = §[3]; 

1 := T{i];m := Ts n := T(3); 

sinplity (cane m*2)/(E*G-F*2) ,sqrt,trig,symbolic) ; 
end: 


> MK := proc(X) 
local E,F,G,1,m,n,8,T; 


S$ := oe. := re 
E := S[1]; = §[2]; = S[3]; 
1 := T[i]; m := T[2]; n := T[3]; 


simplify ((G*1+E*n-2*F*m) /(2*E*G-2*F°2) ,sqrt,trig,symbolic) ; 
end: 


Here are some examples. We consider a sphere, a torus, a catenoid and an elliptic paraboloid. 


> sph:=<R*cos(u)*cos(v) |R*sin(u)*cos(v) |R*sin(v)>; 
sph := [Rcos(u) cos(v), R sin(u)cos(v), R sin(v)] 
> EFG(sph) ; 
[R? cos(v)*, 0, R?] 
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> I1mn(sph) ; 
[—Rcos(v)*, 0, —R] 
> GK(sph) ; 
1 
R2 
> MK(sph) ; 
1 
R 


> tori:=<(Rtr*cos(u))*cos(v)| (Rtr*cos(u))*sin(v) [r*sin(u)>; 


torl := [(R +rcos(u))cos(v), (R + rcos(u)) sin(v), r sin(u)] 


> GK(tor1); 
cos(u) 
r(R+rcos(u)) 
> MK(tor1); 
1 R+2rcos(u) 


2 r(R+rcos(u)) 
> catenoid:=<cosh(u)*cos(v) |cosh(u)*sin(v) |u>; 


catenoid := [cosh(u) cos(v), cosh(u) sin(v), u] 


> GK(catenoid) ;MK(catenoid) ; 
l 
cosh(u)* 


0 


Notice that the mean curvature computation verifies that the catenoid is a minimal surface. Now 
let’s look at the elliptic paraboloid. Some simple algebraic manipulation is necessary to make 
Maple’s calculation jibe with that of Exercise 3.2.13. 


> ellpar:=<ulv|u72/a72+v~2/b*2>; 


a 
ellpar := [ eae & 
a 
> GK(ellpar) ;MK(ellpar) ; 
4b° a® 
(a* b4 + 4u? b4 + 402 at)? 


(a2 b+ +40? v2 +b? at*4+ 4b u?)a? b? 
(a4 b4 + 4u2 bt + 40? at)G/) 
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Exercise 3.8.1. Write a Maple procedure to calculate Gauss curvature (when F = 0) from the 
formula 


Barca as Jag) os Je) 
~  2VEG \au\ JEG) au \ VEG) ) © 
Exercise 3.8.2. Calculate Gauss and mean curvatures for the usual list of surfaces: helicoid, 
Enneper and hyperboloid(s). Also calculate for the pretzel surface 


x(u, v) = (cos(u) cos(v), sin(u) cos(v), sin(v) + x). 


It is also possible to get a picture of Gauss curvature on the surface itself by coloring points 
with similar Gauss curvature by the same color. For example, the following colors the helicoid 
according to its Gauss curvature via the option “color=GK (helicoid)”. 


> helicoid:=<u*cos(v) lu*sin(v)|v>; 


> plot3d(helicoid,u=0..1.5,v=0..5*Pi, color=GK(helicoid), 
grid=[10,40]); 


Exercise 3.8.3. Color Enneper’s surface according to its Gauss curvature. What happens if you 
color according to mean curvature? 


Exercise 3.8.4. Define the pair of pants surface by 
x(u, v) = (u, v, In(fu4 + 2u2 v? — 2u2 + v4 +202 + 1)). 


Carry out the following Maple commands to analyze this surface’s curvature properties. 


> pair:=<ulv]1n((u74+2*u72*v* 2-247 Q+v74+2*v"72+1)7(1/2))>; 
> GK(pair); 
> MK(pair); 


> plot3d(pair,u=-1.5..1.5,v=-1.5..1.5,axes=boxed,shading=xy, 
lightmodel=light2, grid=[30,30] ,orientation=[-33,61]); 


> plot3d(GK (pair) ,u=-1.5..1.5,v=-1.5..1.5,axes=boxed) ; 
> plot3d(MK (pair) ,u=-1.5..1.5,v=-1.5..1.5,axes=boxed) ; 


> plot3d(pair,u=-1.5..1.5,v=-1.5..1.5,scaling=constrained, 
color=GK (pair) ,orientation=[83,147] ,grid=[30,30]); 


> plot3d(pair,u=-1.5..1.5,v=-1.5..1.5,scaling=constrained, 
color=MK (pair) ,orientation=[83, 147] , grid=[30,30]); 


As a less computational subject amenable to Maple, consider the surfaces of revolution of 
constant Gauss curvature. In Exercise 3.7.8, the integral formula for g(u) was determined when 
h(u) satisfies h” = —h. Of course, the answer is that h(u) = A cos(u) + B sin(u) and 


g(u) = a V1 — (—Asin(t) + B cos(r))* dt. 
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Figure 3.13. A constant K = 41 surfaces | Figure 3.14 A constant K = +1 surface: 0.65 


Figure 3.15. A constant K = +1 surface. 1.7 


We may then take g(w) = { \/ 1 - c? sin¢() dw. Notice that the last expression will only make 
sense when the square root is defined. Here is a Maple procedure which graphs surfaces of 
constant positive Gauss curvature (see Figure 3.13, Figure 3.14 and Figure 3.15).The key to the 
procedure is using Maple’s “dsolve” command to solve the differential equation corresponding to 
the integral for g(w) given above, The solution is numerical of course and Maple denotes this by 
the option “type=numeric”™. The option “output=listprocedure”™ allows the values of the numerical 
solution to be assigned to a variable “gu” and then plotted. The single quotes ‘gu’ in the “plot3d” 
command avoid what Maple experts call “premature evaluation” of the numeric solution values. 


> plotposconstcurv:=proc(c,theta, phi) 

local X,desys,gu,s; 

if 1/c>i then s:=-Pi/2..Pi/2 else s:=-arcsin(1/c).. 
arcsin(1/c) fi; 

X:=[c*cos(u) *cos(v) ,c*cos(u)*sin(v),g]; 

desys :=dsolve({diff(g(u) ,u)=sqrt(1-c"2*sin(u)~2) ,g(0)=0}, 
g(u),type=numeric,output=listprocedure) ; 
gu:=subs(desys,g(u)); 

plot3d (subs (g='gu’ (u) ,X) ,u=s,v=0. .2*Pi,scaling=constrained, 
style=patch, shading=zhue, orientation=[theta, phi] ); 

end: 
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TTT 
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Figure 3.16. A constant K = —} surface: 1,0 Figure 3.17 A constant K = —1 surface: 0.3,1 


\AAUUUE 


\ivuge eel, 
Naa! 


Figure 3.18. A constant K = —1 surface: 0.6,0.1 


> plotposconstcurv(1,49,78) ; 
> plotposconstcurv(0.65,49,78) ; 
> plotposconstcurv(1.7,49,69); 


Exercise 3.8.5. Analyze the following Maple procedure to draw constant negative curvature 
surfaces of revolution (see Figure 3.16, Figure 3.17 and Figure 3.18), Explain each line of the 
procedure. 


> plotnegconstcurv:=proc(c,d,theta, phi) 
local X,desys,gu,s,h,kk,init; 

if d=0 then h:=c*exp(u) ; 

$s:=-2.5..In(1/c); 

init:=-1 fi; 

if c>O and d>O then h:=2*sqrt(c*d)*cosh(u) ; 
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s:=arcsinh(-1/(2*sqrt(c*d)))..arcsinh(1/(2*sqrt(c*d))); 
init:=0 fi; 

if c>=0 and d<O then h:=2*sqrt(-c*d)*sinh(u) ; 

kk: =c*d; 

if kk>=-1/4 then s:=0..arccosh(1/(2*sqrt(-c*d)))fi; 

init:=0 fi; 

X:=[h*cos(v) ,h*sin(v) ,g]; 
desys:=dsolve({diff(g(u) ,u)=sqrt (1-diff(bh,u)*2) ,g(init)=0}, 
g(u) ,type=numeric, output=listprocedure) : 
gu:=subs(desys,g(u)); 

plot3d(subs(g=’ gu’ (u) ,X) ,u=s,v=0. .2*Pi,scaling=constrained, 
style=patch, shading=zhue, grid=[15,40] ,orientation= 
[theta,phi]); 

end: 

> plotnegconstcurv(1,0,41,64) ; 

> plotnegconstcurv(0.3,1,47,64); 


> plotnegconstcurv(0.6,0.1,49,69) ; 


To end this section on Maple, we would like to carry out the procedure for finding surfaces of 
revolution with K = —1 using Maple. This will serve as a bit of a warning concerning reliance 
on computer algebra systems without accompanying understanding. Of course, we need the 
following as well as the procedure for Gauss curvature “GK” and all of the procedures required 
for that. 


> with(plots) :with(LinearAlgebra): 


Now we can enter a surface of revolution with parametrization x(u, v) = (h(u)cos(v), h(w) 
sin(v), g(u)). 


> surfrev:=[h(u)*cos(v) ,h(u)*sin(v) ,g(u)]; 


surfrev := [h(u) cos(v), h(w) sin(v), g(u)] 


We can take the Gauss curvature using the GK procedure above. 


> gauss:=simplify(GK(surfrev)) ; 


(-(4 hw) (4 guy) + (4S au) (4 be) (£ gw) 
(£ hw)? + (4 g(u))?)? h(uw) 


gauss := 


Of course we may assume the profile curve a = (h(u), 0, g(u)) is unit speed. This means that 
(£ gu)’ + (£ h(u)y = 1. If we differentiate this expression on both sides, we get 


> diff (diff(g(u) ,u) *2+diff(h(u) ,u)*2,u) =diff(i,u); 


d a d d? 
2 (= hi) (= hw) +2 (+ a) (a a) = 0 
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We can now simplify the Gauss curvature of the surface of revolution using the two relations we 
have just found. The way to do this is 


> gaussK:=simplify(gauss, {diff(g(u) ,u) *diff (diff (g(u) ,u) ,u) 
= ~diff(h(u) ,u) *diff(diff(h(u),u),u), diff(g(u) ,u) 72+ 
diff (h(u) ,u)*2=1}); 


az h(u) 
aussK := — 2° 
h(u) 
So now we have an expression for the Gauss curvature (assuming the profile curve is unit speed). 
We want K = —1, so we solve the differential equation obtained when we set the curvature equal 
to—1. 


> h_ans:=dsolve(gaussK=-1,h(u)); 
h_ans := h(u) = Cle“ + _C2e& 


Let’s put in some initial conditions to make the solution h_ans explicit. 


> hh:=rhs(eval(subs({_C1=1,_C2=0},h_ans))); 
hh := e" 


Now that we have h(u) = e”, we get g(u) by using the unit speed condition (4 g(u))? + 
(Zhu)? = 1. 


> g_ans:=dsolve(diff(g(u) ,u)*2+diff(bh,u)°2 =1,g(u)) [1]; 


g_ans := g(u) = V1 — (e“)* — arctanh ( 


1 
> ] 4+ Cl 
/{ — (e# Ne 
This appears to be a bug in Maple’s integration procedure. The same answer is obtained by 
computing the equivalent integral. 


> int (sqrt (1-exp(u)*2) ,u); 


1 
Vv 1- (e")? — arctanh (=m) 


This is incorrect. So computer algebra systems are not perfect! The hyperbolic tangent tanh is 
always less than 1 in absolute value, but 1/,/1 — (e“)? is always greater than 1 when it is defined. 
Algebraically, the answer above works, but it does not take into consideration that the square 
root can be defined only when 0 < | — (e“)*. The old MapleV4 (or doing the integral by hand!) 
gives the following correct answer. We then use it as g_ans and we substitute in a value for the 
integration constant. 


> (1-exp(2*u))*(1/2)- arctanh((1-exp(2*u))*(1/2)); 


v¥1—e@ — arctanh(v 1 — e24)) 
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Figure 3.19. K = —1 surface of revolution with h(u) = e” 


g_ans:=(1-exp(2*u))~(1/2)- arctanh((1-exp(2*u))~(1/2))+_C1; 
guans := V1 — e@”) — arctanh( V1 —e@) 4+ Cl] 


> gg:=subs(_Ci=0,g_ans) ; 
gg = V1 — e@") — arctanh(y 1 — e?")) 


We can now plot the surface of revolution by substituting our functions into the surfrev parame- 


trization (see Figure 3.19). 


> subs({g(u)=gg,h(u)=hh},surfrev) ; 


[e" cos(v), e“ sin(v), v1 — e?@”) — arctanh(y 1 — e2@"))] 


plot3d(subs({g(u)=gg,h(u)=hh },surfrev) ,u=-2..0,v=0..2*Pi, 
scaling=constrained , shading=zhue, lightmodel=light3, 
orientation=[36,80]); 


Let’s try another set of initial conditions, 
hh2:=rhs (eval (subs({_Ci=1,_C2=1/4}, h_ans))); 
] 
hh2 :=e" + —e-") 
1 aia ae 


Now at we have h(uw) = e“ + ia: we get g(u) by using the unit speed condition (4 piu) + 
(4 h(w)P =I. 

- g_ans2:=rhs(dsolve(diff(g(u) ,u)“2+ diff (hh2,u)~2=1, 

g(u)) [1)); 


| (4? +4e"—l Ge" —4e"—1 
g-ans2 ‘= [; FE See —NO Pte of 17 


> gg2:=subs(_C1=0,g_ans2) ; 


e" 
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Figure 3,20. Square roots make sense in (— 1.574520768, 0.1882264063) 


We can now plot the surface of revolution (see Figure 3.21) by substituting our functions into 
the surfrev parametrization once we know limits of integration for gg2. First we check when the 
square roots have negative numbers in them (see Figure 3.20). 


> evalf(solve(-(4*exp(u)~2+4*exp(u)-1) * (4*exp(u)~2- 
4+exp(u)-1)=0)); 


0.1882264063, — 1.574520768 + 3.141592654 /, 
— 1.574520768, 0.1882264063 + 3.141592654 / 


> plot(-(4*exp(u)“2+4*exp(u)-1) * (4*exp(u)~2-4*exp(u)-1), 
u=-1.57..0.188) ; 


Figure 3.21. K = —1 surface of revolution for A(u) = e” + t e" 
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This means we can define gg3 using gg2 and the limits defined by the interval 
(—1.574520768, 0.1882264063). 


> gg3:=Int(-1/4*sqrt (-(4*exp(s)72+ 4*exp(s)-1)*(4*exp(s)72- 
4xexp(s)-1))/exp(s) ,s=-1.5..u); 


"1 J-(4(e ype t4e — D(4(e —4e = 1 
a= f of Merete NCCP Ae aT, 
aus 4 e 


> plot3d(subs({g(u)=gg3,h(u)=hh2 },surfrev) ,u=-1.5..0.18, 
v=0..2*Pi,style=patch,scaling=constrained, shading=zhue, 
lightmodel=light3, orientation=([36,80]); 


Constant Mean Curvature Surfaces 


4.1 Introduction 


Much of science has to do with determining mathematical principles which underlie the phenom- 
ena of our universe. Although Kant seems to have overstated the case in declaring that humans are 
a priori Euclidean geometers, there can be no denying the proliferation of physical theories based 
on geometry. From Newton, who gave only geometric arguments in the Principia, to Einstein, 
who formulated “gravitation” as the geometry of space-time, to the present-day creators of Grand 
Unified Field Theories, the underlying mechanisms of the universe are geometric in nature. 

Of course, in saying that the great forces of nature are consequences of some geometric 
structure, we lose sight of the fact that phenomena which we observe every day often display the 
same geometric dependence. Furthermore, we have already seen that nuclear cooling towers take 
the shapes of hyperboloids of one sheet in order to prevent unnecessary stresses, so it should not 
be surprising that other types of construction might be similarly geometrically affected. Indeed, 
the Frei Otto school of architecture makes extensive use of minimal surfaces in its designs. 

In this chapter we shall give a brief introduction to minimal surfaces and surfaces of con- 
stant mean curvature. While these surfaces are intrinsically mathematically interesting, they 
also are objects of study and application in the sciences. Basic references for this chapter are 
[Opr00, Oss86, Nit89, DHK W92, FT91]. 

Surface tension is that quality responsible for the cohesiveness of liquids. Think of a liquid 
as a collection of (polar) molecules, each of which exerts attractive forces of equal strength on 
the others. Deep inside the liquid a molecule feels equal forces from all directions, but near the 
surface of the liquid a molecule feels more of a force from inside the liquid than it does from the 
small number of molecules between it and the surface. Hence, those molecules near the surface 
are drawn into the liquid and the surface of the liquid displays a “curvature”. Thus we see liquids 
form into drops or bubbles because surface tension acts as a skin holding the liquid together. Buta 
soap bubble doesn’t shrink away to nothing, so at some point surface tension must be balanced by 
an internal pressure. In the case of the soap bubble, of course, the atmospheric pressure inside the 
bubble is larger than that outside, so an eventual equilibrium is attained. Indeed, this war between 
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x+0x 


Figure 4.1. Work done on a surface 


surface tension and pressure is brought to a cease-fire by the mediating effect of geometry (in the 
form of curvature), 

Around 1800, Thomas Young (who is best known for his diffraction experiments establishing 
the wave nature of light) and Pierre Simon Laplace analyzed the phenomenon of surface tension 
in the following way. Consider Figure 4.1, which we take to be an “infinitesimal” piece of surface 
area expanded outward by an increased pressure. Note that, since the piece is very tiny, we may 
assume that the front and side curves are pieces of circles of radii R, and R2 respectively. 

Let’s compute the work done to expand the area shown (which we denote by 5). Remember 
that work is force x distance and pressure is force per unit area. We then get from Figure 4.1, 


W=F-D 
=p-S-éu 
=p-xy-du. 


But we can compute work in another way. Ask yourself, how could a soap film do work? Try 
this. Take a wire in the shape of a horseshoe and attach (loosely) a string to the two ends by 
loops. Hold the loops and dip the whole object in a soap solution. Now let the loops go — what 
happens? The string is pulled to the top of the wire by the film! This means that the film has the 
potential to do work. Moreover, the quality responsible for contracting the string and the film 
itself is surface tension. Also, from this description we see that the amount of work done must 
be equal to the surface tension T (the units of which are force per unit length) multiplied by the 
change in surface area AS. Of course, 


AS = (x + 6x)y+éy)—xy 


from the picture (approximating the areas appropriately). Now, the length x is part of a circle 
of radius R; and x + dx is part of a circle of radius R; + du with the same included angle. 
Therefore, we have the equality 


x+dx x 


Ri tou Ry 
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Treating y similarly gives 


éu 
xtorax(1+ 2) 


éu 
éy= 1+—}. 
ytoy r(i+ 2) 


Plug these into AS to obtain, 


AS i Wr ar ree 
=x —y —})—-xy 
R y Ro y 
1 l (6u)* 
= 5 —< . 
cas “(ze : a) +8) ee 


If du is small, we may neglect the last term. Now put this expression for AS into the work formula 
W = TAS and equate it to our previous work calculation. We end up with, 


| | 
pxydu=Tx you (= + =) 


R2 
1 I 
ene (5 : i) 

This is the Laplace- Young equation that expresses a fundamental relationship among surface 
tension, curvature and pressure in the context of thin films. The equation has important applica- 
tions, even in areas such as medicine. For instance, alveoli in the lungs obey the Laplace- Young 
equation and this means that, when pressure decreases upon expiration, in order to prevent col- 
lapsing of the airways, it is necessary for surface tension to decrease —~ and this is the function 
of the chemical surfactant found in the lungs (see [BGH77, p. 199]). 

The Laplace- Young equation says that a pressure difference on either side of a bubble or film 
is given by the product of the surface tension of the bubble or film and a quantity which is clearly 
related to the shape of the bubble or film. In fact, notice that the only requirement (implicitly) 
placed on our sntinitgsim piece ar area was that the curves meet at right angles (so S ~ xy). Also, 
notice that the quantities j- and 4 = are the normal curvatures of the surface in those perpendicular 
directions. But, then (by Exercise 3.1.6) we recognize the quantity 2H = 1/R, + 1/R2 as twice 
the mean curvature of the surface. 

Now consider the case of a soap film bounded by a wire say. Since there is no enclosed volume, 
the pressure is the same on both sides of the film. Hence, p = 0 and (since T is constant) H = 0 
as well. Therefore we see that 


Theorem 4.1.1. Every soap film is a physical model of a minimal surface. 


It is clear that surface tension tries to make the surface as “taut” as possible. That is, the surface 
should have the least surface area among surfaces satisfying certain constraints like having fixed 
boundaries or fixed volumes. In fact, this area-minimizing property implies the curvature property 
above and gives rise, in the case of films, to the term minimal. Let’s see how this works. 
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4.2 First Notions in Minimal Surfaces 


To begin, let’s understand (in a somewhat informal way) how to integrate over a surface. Of 
course the simplest surface is the plane and here we do ordinary double integration. Our whole 
philosophy in dealing with surfaces has been to do all calculus computations in the plane and 
then transport them to a given surface via a patch x(u, v). Therefore, it’s no surprise that we do 
the same thing for this situation. In Chapter | we noted that the length of the cross product of 
two vectors equals the area of the parallelogram which they span. Now, what is the area of a tiny 
parallelogram-like piece of a surface? Since it is very small, we may think of the parallelogram 
as spanned by tangent vectors x, and x,. Hence, the area is given by |x, x x,|. This quantity 
then approximates the area of that tiny piece of the surface. Of course this is the usual modus 
operandi of calculus; approximate a small piece and then add up continuously (i.e., integrate) 
over the entire region. So, let x(u, v) be a patch for a surface M. Then we make the 


Definition 4.2.1. The area of the patch x, denoted Ax, is 


Ay = a |X, X X,| dudv, 


where the limits of integration are the defining limits of the patch. 


Furthermore, although we will not prove it, any compact oriented surface may be cut up into 
a finite number of patches x (which, in fact may be taken to be oriented triangles) meeting only 
along their boundary curves with the opposite orientation. Therefore, we may calculate the whole 
surface area of a compact oriented surface M as the sum of the areas of the individual patches 


A=D ff ix. xxl auede. 


Note also that if we miss a few points, this does not affect the integral. So, all of the patches we 
have considered up to this point allow surface area to be calculated in a single integration. We 
shall return to this later and give some exercises, but right now our only aim is to use this idea to 
understand something about minimal surfaces. 


Figure 4.2. Oriented triangulation of a patch 
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Let’s begin our analysis of minimal surfaces. Let < = f(x, vy) be a function of two variables 
and take a Monge patch for its graph: x(u, v) = (u, v, f(u, v)). From Exercise 3.2.26 we have 


Xy = (1. 0, i) Xu = (0, 0. Suu) 
x, = (0,1. f,) Xuv = (0,0. fr) 
Xe = (0,0. fre) 


=fis Ful 
Xy X Xp = (— fue fe. 1) ee oe Re) 
Vl4+ f2+F 
E=1+f? P= fife Gain7 
é = Tu m a iii n= tov 


Woe ce oe Vl+ fR +h? (lt fH 


a Tau Tor a fe 
Oe rie At 


H= (1 + Feaner + ae ae 7S 2 fate fuv 
i+ f2 + f2)3 


The expression for H immediately gives 


Proposition 4.2.2. M is minimal if and only if 


full + £2)— 2 fu fo fue + fowl + £2) = 0. 


This partial differential equation is called the minimal surface equation. By placing algebraic or 
geometric constraints on the function {, we determine various types of minimal surfaces. 


Exercise+ 4.2.3. Suppose we require the algebraic condition that f(x, y) = g(x) + A(y). Show 
that H = Oimplies (1 + h’?(y))e"(x) +(14+ ge (x))h"(y) = 0. Separate variables and solve (see 


Subsection 4.9.4). Infer that 
; 1 cos ax 
f(x,y) = —-In : 
a cosay 


The surface z = f(x, y) is called Scherk s minimal surface. See Figure 4.3. Note that Scherk’s 
surface is only defined for cosax/cosay > 0. For example, a piece of Scherk’s surface is given 
over the square —> <ax < 5,—} <ay < 5. To see how different pieces of Scherk’s surface 
fit together, see [Opr00]. Surprisingly, only the catenoid and helicoid were known to be minimal 
in the 1700s. Scherk’s surface was the next example of a minimal surface and it was discovered 
by Scherk in 1835. ft is a wonder that the simple algebraic condition above was not considered 


earlier. 


Exercise 4.2.4. Suppose we require the geometric condition that every level curve f(x, y) =c 
be a line. What types of minimal surfaces do we now obtain? Hints: 
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Figure 4.3. Scherk’s surface 


® In general, if.a curve is given implicitly as f(x, y) = c, then the curvature is given by 


re =f} + 2h fy fry ra toft 
IvyeP 


Show this by invoking the implicit function theorem and writing f(x, g(x)) = c with para- 
metrization a(x) = (x, g(x), c). Then 


=~ tel 
k= Re Te t. 
37 


Show g’ = -4 by using Vf - a’ = 0. Now find g” by implicit differentiation. 


3 
* Show H =04 fut fy =Kl AP t+ fl. 
* Now « = 0 (why?), so fix + fyy = 0. 
* Show that a solution to Laplace’s equation above with the given geometric constraint is 


f(x. y) = A arctan (=) +B. 
a — X09 
If we let x — x9 = ucosv, y — yo =u sinv, z — B = Av, then we see that the surface must 
be a plane or a helicoid. In fact, the proof that f(x, y) is the only such solution requires 
much harder analysis and is known as Hamel’s Theorem (see [Gra40}]). 


Exercise 4.2.5. A unit speed curve on a surface a: | — M is said to be an asymptotic curve 
if a” -U =0 for each point on the curve. By our formula a” - U = S(a’) - a’ = k(a’), we see 
that asymptotic curves are those curves which travel in directions of zero normal curvature. 
Show that a surface M is minimal if and only if through each point there are two orthogonal 
asymptotic curves. Verify that the parameter curves for the helicoid x(u, v) = (u cos v, u sin v, v) 
are asymptotic and orthogonal at each point. Hint: use Euler’s formula for normal curvature and 
H=0. 
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We have seen in the last chapter that a non-planar minimal surface of revolution must be a 
catenoid. Naturally, we can ask similar questions by imposing other geometric hypotheses. Later 
we shall consider the question of whether a minimal surface can also possess constant Gauss 
curvature. Here, we confine ourselves to the following 


Theorem 4.2.6 (Catalan’s Theorem). Any ruled minimal surface in R? is part of a plane or a 
helicoid. 


Proof. ([BC86]) Let M: x(u, v) = B(u) + vd(u) where, without loss of generality, we make the 
following assumptions: (1) by Exercise 2.1.26, we may take 6 perpendicular to the rulings of 
M (ie., B’ - 6 = 0) with unit speed (i.e., 6’ - B’ = 1). (2) 6 is a unit vector field of directions for 
the rulings along £ (i.e., 5 -5 = 1). Hence, we also have 5 - 5’ = 0. From these assumptions we 
derive the data, 


xX, = B’ + v6’ X=6 x, xXx, =f xb+06' x6 
E=1+42vp’-8'+v7Js' F=0 G=1 
Xuy = B” + v6" Xuyv = 5 Xyy = 0 


_ BY: Bl x 8 + up" +8 x 8 + v8" Bl x 6 + v°8"-8" x8 
7 VE 


Then H = a = 0 since M is minimal, so £ = 0. Now, the numerator of £ is a polynomial 
in v, 


£ =? n=0. 


B’- px d+v[B" 8 x +8": B’ x 8] +078" -8' x 6, 


so for it to be zero, we must have each coefficient zero. Set the first term to zero, B” - B’ x 6 = 0. 
This equation says that 6” is contained in the plane spanned by f’ and 6, denoted (f’, 5). Since 
B is unit speed, ’ and 6” are perpendicular. But by assumption, A’ is perpendicular to 5 as well. 
Since all three vectors are in the plane (f’, 5), it must be true that 6” is parallel to 6. In fact, since 
the length of 6 is | and the length of 6” is the curvature kg of B, we must have B” = +kg6. At 
any rate, we certainly have B” - 6’ x 6 = 0. 

Now look at the coefficient of v set to zero and use 6” - 6’ x 6 = 0 to infer 6”- p’ x6 =0. 
This says that 6” € (8’, 5) also. Now, the coefficient of v2 set to zero is 8” - 5’ x 8 = 0 and this 
implies that 6” € (6’, 5) as well. Hence, 5” € (5’, 5) M (B’, 5). Now there are two possibilities. 

First, we may have that 6” is not parallel to 6 at some point (so in some neighborhood as well). 
This would then entail the equality (6’, 5) = (6’, 5). In this case, a B’ = 5’ since f’, 5’ and 6 are 
in the same plane and both f’ and 6’ are perpendicular to 5. But then the unit normal of M would 
take the form 


_ ( +avy(B' x 8) = 


= ——__—___— = +f’ x §, 
{1 + av| |p’ x 6] 


since |B’ x 5| = 1. Upon taking the derivative of U with respect to u, we have U’ = B” x 6+ 
B’ x 6’ =O since B” == xg5 and B’ = ad’. Hence, U is constant, so M is part of a plane. 
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Second, we may have that 5” is parallel to 5 everywhere. Hence, 5” = a 6 and, consequently, 
p’ 6” = 0. But Bp’ - 6 = 0 implies kg = «gd -5 = B” - 5 = —f’ - 6’ and this, in turn, gives 


d 
phil Te a ae ee 
du 


since 6” and 5” are parallel to 6 and ’ and 6’ are perpendicular to 6. Hence, the curvature xg is 
a constant. Note that if this constant is zero, then is a line and the surface is part of a plane. 
Now consider the torsion tg of B. The usual formula becomes tg = f’ x 5 - &’ since B” = Kg 
and, because « is constant, B’”” = xpd’. 


Exercises 4.2.7. Show that oe = 0. 


Therefore, tg is a constant. Recall that a curve with constant curvature and torsion is a circular 
helix. Up to a rigid motion of R*, we may then parametrize B by 


B(u) = (Acosu, Asinu, Bu) 


with A? + B* = 1. Also, 5 is parallel to 8”, so 6(u) = (cos u, sin u, 0) since 5 has unit speed. Let 
A + v = v. We then obtain a parametrization for the helicoid, x(u, v) = (cosu, dsinu, Bu). 


So, we see that the two most natural geometric conditions to impose along with minimality, 
being a surface of revolution or being ruled, lead to the minimal surfaces of the 18th century, the 
catenoid and the helicoid. 


Exercise 4.2.8. Plot the following surfaces. Show that Helcat, Enneper, Scherk and Planar Lines 
of Curvature are minimal and compute their Gauss curvatures. You may find Maple to be of 
use here, but be careful. If you try Henneberg or Catalan, save your worksheet before actually 
computing! 


¢ Helcat (See Figure 4.4). x(u, v) = (x!(u, v), x?(u, v), x3(u, v)), for any fixed t, where 


x!(u, v) = cos(t)sinh(v) sin(u) + sin(t) cosh(v) cos(u) 
x?(u, v) = —cos(t) sinh(v) cos(u) + sin(t) cosh(v) sin(u) 


x3(u, v) = ucos(t) + vsin(t). 
¢ Henneberg’s Surface (See Figure 4.5). x(u, v) = (x!(u, v), x?(u, v), x3(u, v)), where 
2 
x!(u, v) = 2 sinh(u) cos(v) — 3 sinh(3x) cos(3v) 
2 ; : 28 : 
x*(u, v) = 2 sinh(u) sin(v) + 3 sinh(3x) sin(3v) 
x3(u, v) = 2 cosh(2u) cos(2v). 


© Catalan’s Surface (See Figure 4.6). 


x(u, v) = (u — sin(u) cosh(v), 1 — cos(u) cosh(v), 4 sin(u/2) sinh(v/2)). 
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Figure 4.4. A one-parameter family of minimal surfaces joining the helicoid and the catenoid 


Figure 4.5. Henneberg’s surface 


Figure 4.6. Catalan’s surface 
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Figure 4.7. Enneper’s surface Figure 4.8 Scherk’s fifth surface 


® Ennepers Surface (See Figure 4.7), 
x(u, v) = (u — «/3 + uv’, v— v°/3 + vu?.u? — v’). 


© Scherk’s Fifth Surface (See Figure 4.8). This surface is often written in nonparametric form 
sin z = sinhx sinh y. Parametrically, 


x(u, v) = (arcsinh(w), arcsinh(v), aresin(u)). 
® Planar lines of curvature surface (See Figure 4.9), 
sw. oy (“ t+sinucoshv vteccosusinhu Siac ah v) 
A Su COS : 
vl—c Vv1—c?* 
4.3 Area Minimization 


Now let’s try to understand this word “minimal” a little better. In the mid-1800s the Belgian 
physicist Plateau set the following problem: given a curve C, find a minimal surface M having C 
as boundary, Plateau was interested in thin films (e.g., soap films) and his problem was a natural 


Figure 4.9. Planar lines of curvature surface 


4.3. Area Minimization 71 


outgrowth of his physical expernments. As we shall see below, /eas!-area surfaces are minimal. 
In fact, it was eventually realized that hopes of finding a general method of solution for Plateau’s 
problem rested on producing least-area surfaces. Thus, another version of Plateau’s problem is 
to find a least-area surface having C as boundary. But even the existence of area minimizing 
surfaces is not automatic. In fact, it was only in the 1920s and 1930s that Plateau’s problem was 
first solved by J. Douglas and T. Rado (see [Dou31, Rad71]). They proved 


Theorem 4.3.1. There exists a least-area disk-like minimal surface spanning any given Jordan 
curve. 


(A minimal surface is disk-like if its parameter domain is the unit disk D = ((u. v)|u? + v? < 1} 
and the boundary circle maps to the given Jordan curve.) To show the delicateness of this problem, 
consider 


Exercise 4.3.2, Let C be the unit circle in the xy plane. Show that the following variant of 
Plateau’s problem has no solution: Find the surface of Jeast area which has C as boundary and 
passes through (0,0, 1). Note that this problem begs the question: exactly what is a surface? 
Does a disk with a spike qualify (see Figure 4.10)? This (along with an appropriate definition of 
area) was one of the main difficulties in considering Plateau’s problem. Of course from our very 
“differentiable” point of view, we obviate such considerations. 


Figure 4.10. Disk with spike 


Let’s look at a simpler question. What is a necessary condition that M have least area among 
surfaces with boundary C? The answer may be found through a simplified version of the calculus 
of variations as follows. 

Suppose M: z= f(x, y) is a surface of least area with boundary C. Consider the nearby 
surfaces which look like slightly deformed versions of M, 


M’: z(x,y) = f(x, y) + tex. y). 


Here, g is a function on the domain of / which has the effect. when multiplied by a smail ¢ and 
added to f, of moving points of M a small bit and leaving C fixed. That is, g|¢ = 0, where C is 
the boundary of the domain of f and f(€) = C. A Monge patch for M' is given by 


x'(u, v) = (u,v, f(u, v) + teu, v)). 
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Immediately we compute 


xi, x xi,| =y¥ 1+ te + ihe + 2t( fubu + fv8v) + 17(9,2 al B57): 


By the definition of area, we see that the area of M’ is 


awy= ff (14 B+ B42 hase + fas) + Pe? +e?) dude, 


Now take the derivative with respect to ¢ (which then passes inside the integral), 


a= f Su8u + fo8v + t(gu? + gy’) Woah 
udu J1+ F2 + F2 + 2t( fuBu + fo8v) + t(gu2 + 9,7) 


We assumed z = zo was a minimum, so A’(0) = 0. Therefore, setting t = 0 in the equation above, 


we get 
if Su8ut foBv_ a, dv=0. 
uVlt+ f2+4+ f2 
Now, let 


7 fu8 a fo8 
“Vienee °o fete 


Exercises 4.3.3. Compute 2° and se and apply Green’s Theorem (Theorem 1.6.2). 


We then get 


if Fu8ut foe dudv+ ff SoeO IO ful +L Aha heho ay a 
On aaa fae (1+ f2+ #2): 
= fugdvu  fugedu 
eVitirn \lthth 
since g|¢ = 0. Of course the first integral is zero as well, so we end up with 
[ [SoC Rts fea Bhelehad 
(1+ f2+ #2)? 


Since this is true for all such g (see Exercise 7.1.5 and the following discussion), we must have 


Sau + f2)+ fw PVD ha = 0. 


But this is the minimal surface equation. Therefore, we have shown the following necessary 
condition for a surface to be area minimizing. 


dv =0. 


Theorem 4.3.4. [f M is area minimizing, then M is minimal. 


Exercise 4.3.5. Let M be a graph z = f(x, y) above a domain D C R? with boundary a closed 
curve C. Show that, if f satisfies the minimal surface equation, then M has least area among all 
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graphs of functions z = g(x, y) with g|c = fJc. Hints: (1) a surface Z is formed by adjoining 
the two graphs along g(C) = f(C), (2) take the normal vector field for M defined in all of D x R 
by 


AS =f, x —f, y 1 
f a SO ONS OEE. ? 
A+fRP+R JlIt+ PtP JIt+P+Kh 
(3) show div(U;) = 0, apply the divergence theorem and estimate each surface integral obtained 


by using the fact that U; is a normal vector field for one of the pieces of Z, but not the other! 
(See [Opr00] for a discussion.) 


Exercise 4.3.6. Determine the parameter a for the catenary given by y = a cosh(*) which passes 
through the points (—.6, 1) and (.6, 1). Compute the surface area of the catenoid generated by 
revolving the catenary about the x-axis. Show that this surface area is greater than the surface 
area of two disks of radius 1. Hence, if the original boundary curve consisted of two circles 
perpendicular to the x-axis centered at (—.6, 1) and (.6, 1), then the catenoid would be a minimal, 
but non-area-minimizing, surface spanning the boundary. Hint: Maple (e.g., “fsolve” and numeric 
integration). 

In fact, more can be shown. Let x9, —xo be the points on the x-axis which are centers of the 
given circles of radius yo. 


e Lif = > .528 (approximately), then the two disks give an absolute minimum for surface 
area. This is the so-called Goldschmidt discontinuous solution. 


° 2. If os < .528 (approximately), then a catenoid is the absolute minimum and the Gold- 
schmidt solution is a local minimum. 


© 3.1f.528 < = < .663 (approximately), then the catenoid is only a local minimum. 


e 4.If oe > .663 (approximately), then there is no catenoid joining the points. This can be seen 
using soap films by forming a catenoid between two rings and slowly pulling the rings apart. 
Measure . and you will see that the catenoid spontaneously jumps to the two disk solution 
at approximately = = .663. For an informal discussion of this problem, see [Ise92]. For a 
formal discussion, see [Bli46]. Finally, for a Maple analysis, see [Opr00]. 


4.4 Constant Mean Curvature 


Now let’s look at situations where mean curvature is nonzero, but constant. Before we begin, we 
present a very beautiful formula which will be the foundation for our study. Let M be a compact 
oriented immersed surface in R? with unit normal U = (x, x x,)/(|x, X X,]), mean curvature 
Hand area A = f f |x, X Xy|du dv. Recall that when we say that M is immersed, we mean that 
we allow self intersections, but the map on tangent planes is still one-to-one. In terms of a patch 
x(u, v) for M, an immersion requires x, and x, to be linearly independent at every point. Thus, 
a nonzero normal x, x xX, is well defined at every point of the surface. Enneper’s surface is an 
example of an immersed surface. 

To continue, just as in our derivation of the minimal surface equation, we perturb M a bit by 
a vector field V on M. Note that we use a vector field here instead of a function g because M is 
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not defined as the graph over some domain. 
M': y'(u, v) = x(u, v) + tV(u, v) 


and we write the area of M’‘ as 


Ge / veewdanay 


= i Vis x xy? + 2t(x, x x,)(Xy x Vy + Vy x Xp) + O(t?) dudv. 


Then, taking the derivative with respect to t and evaluating at t = 0, we obtain 


A’(0) = // rae (x, x V, + Vy x Xy)dudv 


X, X Xy 
-{[= (x, x V, —x, x V,) dudv 


XX Xy| 


= [f MU xx = VU xx dud. 
Exercises 4.4.1. Let P= —V-U x x, and Q=V-U xx ,, apply Green’s Theorem and the 
shape operator formulas of Section 3.1 to get 
[[ Mu xm WU emt VOR xx)dudv = [ V-Uxx,dv+V-U xx, du, 
Cc 
When we integrate over the entire collection of patches on M, the line integrals around the 


boundaries all cancel due to the orientation of M. Hence, over M, the right-hand side above is ~ 
zero and we have 


A(O)= ff Wy-U xx, —Ve-U xx, dudv 


= -ff VQHx, x x,)dudv 


== i 2HU-VdA since, by definition, dA = |x, x x,| dudv. (4) 


Exercise 4.4.2. Suppose that M is a part of a surface bounded by the curve C. Then the line 
integrals above do not cancel on the boundary curve. Hence, if C is parametrized to have unit 
speed, show that 


A= ffrnu-vaa—fv-wxtas 


where T = x, u’ + x, v’ is the unit tangent of C. Further, suppose that the variation vector field 
V is always along the surface in order to keep the boundary curve in the surface. That is, suppose 
that V -U = 0 always. Show that 


A= [vx Tas. 


This exercise will prove important in Exercise 7.6.12. 
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Let’s do two quick applications of formula (**). First, let V = x(u, v) so that y’ = (1 + t)x and 
A(t) = (1 + £)*A since M is simply stretched uniformly. Clearly, A’(0) = 2A so we have 


a=-f HU -xdA. 


Thus we get a formula connecting surface area to mean curvature. We will exploit this below. 
Secondly, let V = fU, anormal vector field. Of course, this is the type of force field we would 
expect to arise from pressure exerted uniformly on a surface. Now, U- U = 1, so we have 
A'(0) = —2 ff fH dA. Also, the geometric interpretation of the divergence of a vector field as 
arate of change of volume Vol (see [MT88, p. 227]) together with the divergence theorem itself 


gives 
Vol’(0) = ff [ows U dx dydz 


ai, fU-UdA 


=) fda. 


(Recall that the divergence (or Gauss’s) theorem says that, for a surface M enclosing a volume 
Q, the volume integral of the divergence div V = 0V!/4x! + 0V7/4x? + 0V3/4x? of a vector 
field V = (V!, V*, V) is equal to the surface integral of the component of V normal to M. 


Formally, 
[[[ svvaa= ff V-UdA 
Q M 


where U is the unit outward normal of M.) So, the problem of minimizing surface area (i.e., 
A’(0) = 0) subject to having a fixed volume (i.e., Vol’(0) = 0) is equivalent to finding H satisfying 


[[ si aaqo for all f with [[ faaqo. 


Exercise 4.4.3, Show that this condition implies H = c, a constant. Hints: (1) Write H = 
c+(H —c)=c+J wherec = HEA (2) Show ff J?dA = f f J(H —c)dA =0. (Use the 
hypothesis.) (3) Then J = 0. (4) Hence, H =c. 


Of course, the surface tension of a bubble works to achieve exactly the situation above. Therefore, 
we have shown that 


Theorem 4.4.4. A soap bubble must always take the form of a surface of constant mean 
curvature. 


But what are these surfaces then? We have seen in Chapter 3 that, in the 1800s, Delaunay 
classified all surfaces of revolution with constant mean curvature. Of these, however, only the 
sphere is compact (i.e., closed and bounded) and embedded in R°*. The question arose whether 
there are any other compact embedded surfaces of constant mean curvature? In other words, must 
non-self-intersecting soap bubbles always be spheres? The answer, originally due to Alexandrov, 
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is yes. To prove Alexandrov’s result we make use of our formula above together with a beautiful 
estimate due to Antonio Ros (see [Ros87]). In these results, notice the essential connections 
between geometry (in the forms of area, volume and curvature) and vector calculus (divergence 
theorem, curvature). Of course, curvature is the link which binds these subjects together. We begin 
by giving Ros’s estimate, then prove Alexandrov’s theorem and, finally, prove Ros’s theorem. 
Ros’s theorem is pretty heavy going, but an understanding of the ideas involved definitely has its 
geometric rewards. (The following is adapted from [Oss90].) 


Theorem 4.4.5 (Ros). Let M be a compact embedded surface in R* bounding a domain D of 
volume Vol. If H > 0 on M, then 


1 
I — dA > 3Vol. 
a 
Furthermore, equality holds if and only if M is the standard sphere. 


Theorem 4.4.6 (Alexandrov). [f M is a compact embedded surface of constant mean curvature, 
then M is a standard sphere. 


Proof. Suppose H is constant and use the area formula above and the divergence theorem to get 
(where U is the interior normal) 


a=—|f nu-xaa 
=H ff [ divedxayds 


= 3H Vol. 
But then we have 
A 
[[qesqaqffea-g-zn3v0 
Vol 
and by Ros’s theorem, M must be a sphere. O 


Proof of Ros’s Theorem. First, let’s fatten M up a bit by taking a shell S on M’s inside given by 
S={pteh(p)U|peM} 
where U is the unit normal of M,0 < € < 1 and h(p) is defined by 


h(p) = sup{r | the point p is the unique nearest point on M to the point g 


at a distance r from p along the normal U}. 


This definition means the following. For any p € M, go in the direction of the inner normal U 
and consider each of the points g along the way. For some of these points q on the line p + 1U, 
p is the nearest point of M to g. Now just focus on these points and take the distance r of the 
farthest such point. (In fact, there is a small technical point here. It is possible that a point g 
which is farthest does not have a unique nearest point on M. Think of the center of the sphere! 
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This is why we take the supremum of the distances of the q’s for which p is the unique nearest 
point.) 

We consider this shell S because we use the volume V of D in our estimate and we will see 
that 


Vol(D) = Vol(S). 


Moreover, the volume of § may be calculated in terms of curvatures in a convenient way. In order 
to see the volume equality, first note that there can be no overlap on the interior of the shell. This 
follows from the definition of A(p) since every point of the shell lies on a unique normal line to 
M. Of course, there may be overlap on the inside boundary of the shell (as the sphere example 
shows), but this will have no effect on volume since the boundary is only 2-dimensional. 

Secondly, let g € D be arbitrary and let d be the distance from g to M (1.e., to M’s nearest 
point). Of course such a distance is obtained along a line perpendicular to M. At any rate, if 
B(q, d) denotes the closed ball of radius d about q, then the interior of B(g, d) contains no points 
of M and the boundary of B(q, d), denoted 3 B(q, d), contains at least one p € M. Consider the 
radius of B(q, d) from q to such a p and take any q’ on this radius. Let r be the distance along the 
radius from q’ to p and note that B(q’, r) lies in the interior of B(g, d) except at p. Therefore, p 
is the unique point of M realizing distance r from q’ to M and all of these q'’s are in the defining 
set for h(p). Now, the supremum of the distances of the q’ is d since the q’ get arbitrarily close 
to qg. Of course, there may be other points in the set further out, but nevertheless, we see that 
d < h(p). This means, by definition of the shell S, that g € S. Thus, every point of D is in the 
shell S$ and the only overlap of S occurs on the 2-dimensional boundary. Therefore, the volumes 
of D and S must be the same. 

Now, if M has a local parametrization x(u, v), then we may parametrize the shell S by 


yu, v,t) = x(u, v) +t U(y, v). 


Note that we need three parameters u, v and t since the shell is 3-dimensional. Here, t varies 
as 0<t <h(p)=A(u, v) where p = x(u, v). Now, a small chunk of S may be thought of 
as a parallelepiped determined by y,, y, and y,. Consequently, the small chunk has volume 
lyu X Yv - yz|. The volume of S is then obtained by integration 


h{u,v) 
vous) = fff Yu X Yo 'Y:|dt dudv. 
0 


To determine the integrand, for the moment assume that the parameter curves x(u, v9) and x(uo, v) 
are lines of curvature. That is, k, x, = S(x,) = —Vx,U = —U, and kz x, = S(x,) = —Vx,U = 
—U,. Then 


y, =x, +tU, =(1—kit)ey, yo =X ttU, = (1 — kat)x,. 


and y, x yy = (1 — kyt)(1 — kot) (x, x x,) = (1 — 2Ht + Kt*)(x, x x,), since kj +k) = 2H 
and kjkj = K. But H and K don't depend on the parametrization chosen. So, even though 
we assumed x, and x, were principal vectors to obtain the formula, in fact the formula must 
hold in general! Of course, y, = U, so this gives us the equality ly, x yy -y,| = |1—2Hr+ 
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K??||x, x x,| and 


h(u,v) 
vous) = fff lvu X Yo ¥:ldt du dv 
0 
A(u,v) 
=/f\f {1 -2H1+ Keath Is x x,|du dv 
0 
A(u,v) 
=} {/ i= 2ne+ Klar] aA 
M 0 


h(u,v) 
=f / (1 — kit) — kyt)idt dA. 
M /0 


From this expression for the volume of the shell we see that the principal curvatures come into 
the scheme of things. In fact, they are related to the function h(p) by the following estimate: 


I 
i) > max{k;(p), k2(p)} 


for all p € M. Let’s see why this is so. Let q be the point a distance h(p) along the normal 
from p. Then the open ball B(g, h(p)) cannot contain any points of M, for if there were a 
p’ € MQ Big, h(p)), its distance to g would be less than A(p) and for all g’ close to g on the 
radius from q to p the distance from q’ to p’ would be less than the distance from q’ to p. This 
contradicts the definition of h(p). 


Exercise 4.4.7. Prove this last statement. Hints:(1) A(p) is a supremum, so there is a sequence 
q’ — q with p the nearest point of M to q’ along a normal, (2) if p’ exists, then the distance of 
q’ to M is less than the distance of q’ to p, (3) since this is true for all g’ close to g, the sequence 
q' > q can’t exist. 


Hence, B(qg, h(p)) contains no points of M and, therefore the sphere that bounds B(q, h(p)), 
S(q, h(p)), lies completely inside the closure of D. In particular, the sphere lies inside M, 
touching at p. This means that S(q, (p)) has larger normal curvature at p in every direction 
than M. (Compare the proof of Theorem 3.5.3 for the opposite estimate.) Of course, the normal 
curvatures on a sphere are constant and equal to the reciprocal of the radius. In our case then, the 
normal curvature is always ne and we have the required inequality. 

Now, the inequality above says that k, h(p) <1 and k,h(p) < 1. Hence, (1 — kit) anc 


(1 — kot) are nonnegative for 0 < t < A(p) and 


h(p) h(p) 
/ |. — kit) — kat)| dt -| (1 —kyt)(1 — kot) dt. 
0 0 


The normal curvature inequality and the geometric-arithmetic mean inequality ab < (ey 


imply 
(-sy) = (aby) (ab) 
A(p) A(p) A(p) 
< (1 — kit) — kat) 
<(1— ary. 
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1 1 
Hence, —— > H(p) or —— > A(p). This gives 
h(p) H(p) 


A(p) B l 
/ (1 —kyt)(1 — kot) dt <[ (1 — Arty dt = — 


and consequently, 


A(p) \ 
Vol(S) =f ff (—kit)(l —kyt)dtdA <[ 3H 74 


as was desired. Finally, note that for equality to hold, it must hold in 
(l—ki — kt) < (1 — Ary. 


Exercise 4.4.8. Show that (1 — k,t)(1 — kot) = (1 — Ht) if and only if k, = k, everywhere. 


Hence, equality holds in the estimate if and only if M is all umbilic. By Theorem 3.5.2, this is 
true if and only if M is a sphere. O 


Results such as Alexandrov’s theorem led H. Hopf to conjecture that all immersed surfaces of 
constant mean curvature are spheres. It has only been in recent years that this conjecture has been 
shown to be false. H. Wente [Wen86] was the first to construct tori of constant mean curvature 
which are immersed in R>. Since Wente’s work, many such have been constructed with important 
consequences for mechanics (for example). We have by no means given an exhaustive study of 
mean curvature. There are many interesting and important questions which remain and we hope 
that the brief outline above will lead the reader to further study. 


4.5 Harmonic Functions 


We now want to begin to discuss some of the connections between minimal surfaces and other 
areas of Mathematics. We will examine these connections more closely in subsequent sections, 
but it is worthwhile to understand, from the outset, that this subject touches, and is touched by, 
many areas of Mathematics and Science. 

A very important partial differential equation studied in mathematical physics is the Laplace 
equation or potential equation. In two-dimensional Cartesian coordinates, the Laplace equation 
is 

af OD ah a 


A@®=— ++ > =0. 

ax? ay? 
(An analogous version of this equation is found in higher dimensions as well as in other 
coordinate systems.) This equation describes, among other things, steady-state temperature 
distribution, equilibrium displacements of a membrane and gravitational and electrostatic 
potentials. 


Definition 4.5.1. A real-valued function ®(x, y) is said to be harmonic if all of its second- 
order partial derivatives are continuous and each point in its domain satisfies the Laplace 
equation. 
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Theorem 4.5.2. If f(z) = u(x, y) +iu(x, y) is complex analytic (i.e., the derivative f'(z) exists 
at all points of a region of the complex plane), then each of u(x, y) and u(x, y) is harmonic. 


Example 4.5.3 (Harmonic Conjugates). 
Let f(z) = 27, where z = x + iy. Then 


a (x+ iy) 
=x? 42ixy — y’ 
= = y?) + i(2xy). 


So, in this case, Re(z”) = u(x, y) = x? — y* and Im(z*) = v(x, y) = 2xy are the particular 
harmonic functions (known as harmonic conjugates) produced from f(z) =z”. To verify 
this, note that the partial derivatives for u are uy = 2x, uy = —2y, ux, =2 and uyy = —2, 
then u satisfies the Laplace equation, ux, + uy, = 2+(—2) = 0. Similarly, the partial deriva- 
tives for v are vy = 2y, vy = 2x, vx, =0 and vy, = 0, so v satisfies the Laplace equation, 
Uxx + Vyy =0+0=0 as well. 


Exercise 4.5.4. Let f(z) =z, where z = x +iy. Calculate the real part of f. The harmonic 
function obtained produces the surface known as the monkey saddle. Calculate the monkey 
saddle’s curvature functions. Graph the monkey saddle. 


It is often useful to visualize what harmonic functions look like in R? in order to discuss some 
of their properties. The graphs of the two harmonic functions in the previous example are the 
standard “saddle” surfaces. It is not always a simple task, though, to recognize the graphs of 
more complicated harmonic functions. This is where our soap film can be used, since an informal 
physical interpretation can be simply made using soap film. However, it cannot be said in general 
that graphs of harmonic functions are minimal surfaces or vice versa. Consider the harmonic 
function u(x, y) = x? — y* from the previous example. Calculating H for the corresponding . 
Monge patch yields 


_ 20+ 4y?)— 2014427) ay? — 4x? 
~ 214+ 4x244y2) 7 1 44x? 4 4y?? 


Clearly, the mean curvature for sucha surface is not identically zero. Similarly, in general, minimal 
surfaces are not graphs of harmonic functions. Very little is known about the parametrizations for 
an arbitrary soap film. Finding explicit parametrizations for minimum area surfaces, even when 
given relatively simple boundaries, has proved to be a nearly impossible task since Plateau’s day. 

A natural question to then ask is, how closely do minimal surfaces approximate harmonic 
functions? First, consider the plane. The function z = ®(x, y) would have the form ax + by +c, 
which is clearly harmonic. (The variable z is not a complex variable here, but is the vertical 
coordinate in Cartesian 3-space.) Now, what if the surface were nearly planar? Using an argument 
similar to the one we used above to show area-minimizing implies minimal, it can be shown that 
in this case the function ® which minimizes area across a loop is harmonic as well. 


Exercise 4.5.5. Suppose z = f(x, y) is a function whose graph spans a given curve and has least 
area among surfaces spanning the curve. Further, suppose all higher powers > 4 of partials of f 
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are negligible. That is, f?, f and f? f? are so small that the integrand ,/1 + f? + f? of the area 
integral is well approximated by Ja + 5Lf2 + #2)? = 1+ 31/2 + £7]. Then, up to a constant 


provided by the term 1 and the factor i, the relevant area integral is 


A =} fp + fp dx dy. 


Now vary z = f(x, y) to get a nearby surface z'(x, y) = f(x, y) +tg(x, y) and carry through the 
derivation of Theorem 4.3.4 as before using the area integral above to show that f is harmonic. 
See [CHS53]. 


Unfortunately in most cases, as the minimal surface becomes less planar, it begins to vary from 
the graph of a true harmonic function. The argument used above breaks down when fourth-order 
terms are no longer negligible. The soap film is less and less a good approximation of the graph 
of a harmonic function. 


Another interesting relationship between minimal surfaces and harmonic functions comes 
about when the surface is parametrized by isothermal coordinates. A parametrization x(u, v) is 
called isothermal if E = G and F = 0. We will show in Section 4.7 that every minimal surface 
has a parametrization with isothermal coordinates. When isothermal parameters are used, there 
is a close tie between the Laplace operator Ax = x,, + Xy, and mean curvature. Recall from 
Formula(s) 3.4.3 that we have the following formulas for an orthogonal coordinate system: 


Ey E, 

Xuu = ap age tly, 
E, G, 

Xyuy = 7B +t age tm, 
G G 

re — 7pm + xg +nu. 


Theorem 4.5.6. Jf the patch x is isothermal, then Ax ef Xuu +X = (QE A)U. 
Proof. Since E = G and F = 0, we have 
+ ae ae +1U)4+ Gis sp te U 
Xu Xoo = | Taku -— DD Ta ies Are n 
“ 2E 2G” ES IG 
E E E, E 
a Tae - pk tl — sexu + eke tnU 
= d +n)U 


l 
a Vay 
2E 
By examining the formula for mean curvature when E = G and F = 0, we see that 
El+En Il-+n 
A =-——— = , 
2E? 2E 
Therefore, X,4 + Xv = (2 E H)U. imi 
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Corollary 4.5.7. A surface M: x(u, v) = (x! (u,v), x?(u, v), x°(u, v)), with isothermal coordi- 
nates is minimal if and only if x', x? and x* are harmonic functions. 


Proof. lf M is minimal, then H = 0 and, by the previous theorem, x,,, + X,y = 0. Therefore, x is 
harmonic. On the other hand, suppose x!, x2, and x? are harmonic functions. Then x is harmonic, 
$O Xyu + Xv = 0 and, by the previous theorem, (2 E H)U = 0. Therefore, since U is the unit 
normal and E = x, - x, 40, then H = 0 and M is minimal. oO 


This result will have a major impact on our discussion of minimal surfaces in the next few 
sections. It is, in fact, the link between the geometry of minimal surface theory and the powerful 
methods of complex analysis. For more on minimal surfaces and their relation to the sciences, see 
[Opr00], as well as the wonderful books [Nit89, DHK W92, Oss86, Boy59, Tho92, HT85, Ise92]. 
In particular, [Nit89, Introduction] and [HT85] provide histories and overviews of the calculus 
of variations, minimal surfaces and the uses of minimal surfaces in science, engineering and 
architecture. Now let’s turn to the link between minimal surfaces and complex analysis. We 
begin by reviewing some notions from complex variable theory. 


4.6 Complex Variables 


At the end of the last section we hinted that there is a close relationship between the subjects of 
minimal surfaces and complex analysis. Now we describe the basics of complex variables which 
are fundamental to this relationship. 

We denote /—1 by i and the field of complex numbers by C = {z =x +iy|x,y ER}. A 
function f: C > C is continuous at zo if lim,.,, f(z) = f (Zo). If this is true for all zo in 
some open set D, then f is continuous in D. The function f is complex differentiable at 
Zo E Cif 


him £6 FO) 


E> 20 P Amer I) 


exists. In this case, the limit is denoted f’(zo). If the limit exists for all zo € D, D open, then 
we say that f is analytic or holomorphic in D. Notice that, although this definition resembles 
the usual single-variable calculus definition of derivative, it is much more subtle since z may 
approach zy from any direction along any kind of path. We can use this subtlety to our advantage, 
however. Because the range of f is C, we can write f(z) = f(x tiy) = (x, y) +iv(x, y) 
where ¢ and 1 are real-valued functions of the two real variables x and y. The function @ is 
the real part of f while y is f’s imaginary part. Now let’s assume the limit above exists and 
compute it in the special case of z = iy > zo = iyo. 


iiaas (Xo, y) tiv (4o, y) — [O(%o. yo) + iv (0, Yo)] 


yy i(y — yo) 


= fim (x0, y) = (xo, yo) a iv(Xo, y) = iv(xo, yo) 
y> Yo i(y — yo) 
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lim (xo, Y) — O(Xo; Yo) rs iv(xo, y) — iv (Xo, Yo) 


y>% i(y — yo) i(y — yo) 
_ lad , aw 
i ay z oy 

dy". 08 
~ ay ay 


Exercise 4.6.1. Show that, when z = x > 29 = Xo, then the limit is oe +i ue 


Of course, if f is complex differentiable at zo, then both of these limits are equal to f’(zo) and, 
hence, to each other. Therefore, we have 


a6 ay 36 av 
ax ay ay dx" 


These are the Cauchy-Riemann equations. In fact, the analysis above may be enhanced to show 
that f is holomorphic on D if and only if oe oe a a exist and are continuous on D and the 
Cauchy-Riemann equations hold. We note here, without proof, that if f is holomorphic, so are 


all of its derivatives f’, f",.... 


Exercise# 4.6.2. Show that f(z) = z* is holomorphic and compute f’(z) from the limit directly 
and from the Cauchy-Riemann equations. 


Exercise 4.6.3. The complex conjugate of z =x +iy is Z = x — iy. Show that f(z) = Z is not 
holomorphic. 


Exercise 4.6.4. The modulus of z = x +iy is |z| = ye? + y? = Jez. If f(z) =u +iv and 
f(z) =u —iv, then we write | f| = /u? + v2 = / f f. Show that a holomorphic function f 
with | f| constant is itself constant. 


Suppose f is holomorphic. The Cauchy-Riemann equations give 


rp ae ddW 4 dy 


ax2 ay? sax By ~— Ay Ox 


_@y #y 
~ axdy dydx 
=0 


since mixed partials are equal. Thus, ¢ and (similarly) w satisfy the Laplace equation An = 0 
(where A = 07/0x? + 87/8y*) and, so, are harmonic functions. Conversely, if ¢ is a twice 
differentiable harmonic function of x, y, then on some open set there is another harmonic function 
w such that f = ¢ + iy is holomorphic. Harmonic functions ¢ and w which give such an f are 
said to be harmonic conjugates. From now on, for convenience, we shall use the notation ¢, for 
the partial derivative with respect to x, etc. 
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Exercise 4.6.5. Let @ = x? — y* and find its harmonic conjugate y as follows. From Cauchy- 
Riemann, ¢, = py, so integrate ¢, with respect to y to get Y up to a function of x alone. Now 
use dy = —w, to determine this function. Are you surprised at your result? 


Integration of complex functions is accomplished through the usual line integrals of vector 
calculus. Suppose f = @ + iw is continuous and y(t): [a,b] > C is a curve. Then we define 
the integral of f along y to be 


/ f= [ ” Fryar. 


Exercise 4.6.6. Show that we have 
[3 = [ os, yas = vex »ddy +i f dx, dy + UO ya, 
Yy Yy Y 


where the integrals on the right-hand side are real variable line integrals. 


Exercise 4.6.7. Suppose f is holomorphic with continuous derivative on and inside a closed 
curve y. Use Green’s theorem on each integral in Exercise 4.6.6 to show that 5 f =0. 
This is a weak version of Cauchy’s theorem. Show that this implies that integrals of holo- 
morphic functions only depend on the endpoints and not on the paths chosen over which to 
integrate. 


The most important thing for us to remember is that there is a Fundamental Theorem of 
Calculus for complex integrals. Namely, if f is holomorphic, then 


/ f= f)— fla). 
Y 


Therefore, many of the formulas from ordinary calculus carry over into complex analysis. 
With a view to the future, when we shall consider a patch x(u, v) with complex coordi- 
nates, we write z = u +iv, Z = u — iv and introduce the following notation for complex partial 


differentiation: 
a lfa a a 1 fa eG a 
—=-(>--i-—}, —=={[—+i-—|]. 
az 2\au dv a2 2\au av 
One advantage of this notation is that it provides an easy test for f to be holomorphic. 
Exercises 4.6.8. Show that f is holomorphic if and only if of = 0. 
Exercise 4.6.9. Show that 


le a (a 
af © f+ fn =4 (5 (=). 


4.7 |sothermal Coordinates 


As we saw in Section 4.5, the key to introducing complex analysis into minimal surface theory 
is the existence of isothermal coordinates on a minimal surface. Recall that a patch x(u, v) is 
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isothermal if E = x, -X, = X,-X, = G and F = 0. In fact, isothermal coordinates exist on all 
surfaces, but the proof is much harder than the one given below (due to Osserman [Oss86]) for 
minimal surfaces. 

Theorem 4.7.1. /sothermal coordinates exist on any minimal surface M C R°. 

Proof, Fix a point m € M. Choose a coordinate system for R? so that m is the origin, the tangent 


plane to M, T,,M, is the xy-plane, and near m, M is the graph of a function z = f(x, y). 
Furthermore, the quotient and chain rules give 


72 Re 
(=) - (44) oD ip Aap aIp ity thd Dl 


w w w 


Ww Ww 


Ith : : 
(+4) - (42) =- Epps D- 2h Soho + fll + 2] 
% y 


where w = ,/1 + f? + f?. As is traditional (and convenient), let p = f,,q = f,. Because M is 
minimal, f satisfies the minimal surface equation 


fex(l + f2)—2fc fy fry + fy + f2)=0 


so we have 


(=) -(2),-0 and (22) - (4) =o 


Define two vector fields in the xy-plane by 


1 2 1 2 
pe (24, 4) se egg! (22, =f) 
Ww WwW W W 


and apply Green’s Theorem to any closed curve C contained in a simply connected region R to 


obtain 
1 2 
[v= ff (9, -(E2) aray=o 
Cc R\W/x WwW y 
1 2 
[w-ff (=*) - (77) dx dy =0. 
Cc R w x Wy 


Since the line integrals are zero for all closed curves in R, V and W must have potential functions 
(see [MT88]). That is, there exist « and p with grad(uw) = V and grad(p) = W. (Here we use 
“grad” to stand for the gradient to avoid confusion with the covariant derivative.) Considered 
coordinate-wise, these equations imply n, = Es ag by = Pa and p, = “4, py= Leg" Define a 


Ww 
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mapping T: R — R? by 
T(x, y) = +H, y), ¥ + OX, y))- 
The Jacobian matrix of this mapping is then 


2 
NT) = I+px By | _ 1442 ae 
Px 1+py Pa 1442 


WwW 


and we calculate the determinant to be det J(T) = (1 + w)*/w > 0. The Inverse Function The- 
orem then says that, near m = (0, 0), there is a smooth inverse function T~!(u, v) = (x, y) with 
Tj=s(T)' = 


l+q? 2 
er ee ae Pee ee fe Ca —Pq 
detJ(T)| -8 1447) (1+w?L —pq w+l+p? 


Yu yu 


by the definition of the Jacobian. We will put these calculations to use in showing that the 
following patch (in the uv coordinates described above) 


x(u, v) & (x(u, v), y(u, v), f(x(u, v), y(u, v))) 


is isothermal. First we calculate 
w+l+q? -—pq w+l+q —pq 
ty = 7? aE ae hat 
(1 + w) (1 + w) (1+ w) (1+ w) 


E =x, +X, 


and 


1 
= as oy lt +14 97S 4+ pq? t p(wtlt@’y —2p’q?(w+1+q’)+ p’q'] 


ee 
~ (1+ wy 


w 


~ (+0 


[d+ wy(1+q? + p’)] 


Exercise 4.7.2. Show that G = x, -x, = E and F = 0 by computing 
( —pq wt+l+p? —pq ) (tr )) 
x= | —— = SY - 
(lt+wy’ (l+w) (1+ w) (1+ wy 
Hence, the patch x(u, v) is isothermal. Oo 


Exercise 4.7.3. If M: x(u, v) is a surface with isothermal parametrization, show that the formula 
for mean curvature reduces to H = in Hence, for M minimal, / = —n. 
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4.8 The Weierstrass-Enneper Representations 


From now on, unless stated otherwise, we shall take M to be a minimal surface described by an 
isothermal patch x(u, v). We let z = u + iv denote the corresponding complex coordinate and 
recall Z = (z —iZ). Since u = ae and v = 2 we may write 


x(z,2)=@' (2. 2),2"@.2)0 2) 


LG regard x'(z, z) as a complex-valued function which happens to take real values and we have 


ae = mes: — ix!). Define 


def OX 


soa aCe eo) 


Let’s examine ¢ a bit more closely. We shall use the following notation: (6)? = (x)? + (x2)? + 
(x3) and ||? = |x} ]* + |x2/? + |x3|?, where |z| = /u? + v? is the modulus of z. First, note that 
we have (x!)? = 4((x/)? — (ai)? — 2ixix!). Therefore, 


3 
(gy = ; Gly - su! yo2r Da 
j=l 


j=l 


= (lsat — |x, |? — 2x, - x) 


1 
= {(E - G-2iF) 


=0 


since x(u, v) is isothermal. By comparing real and imaginary parts, we see that the converse is 
true as well. Namely, if (¢)* = 0, then the patch must be isothermal. 


E 
Exercise 4.8.1. Show that |¢|* = = #0. 


Finally, $¢ = (2) = Ax =0 since x is isothermal. Therefore, each ' = 2 is holomorphic. 
Cencua the: same calculation shows that, if each ¢! is holomorphic, then each x! is harmonic 
and, therefore, M minimal. All of these observations together give 


Theorem 4.8.2. Suppose M is a surface with patch x. Let @ = 2 and suppose (oy = 0 (i.e., x 
is isothermal). Then M is minimal if and only if each ¢' is holemorphic 


In case each ¢' is holomorphic we say that ¢ is holomorphic. The result above says that any 
minimal surface may be described near each of its points by a triple of holomorphic functions 
¢ = (¢', 6, ¢°) with (¢)* = 0. Indeed, in this case we may construct an isothermal patch for a 
minimal surface by taking 


Corollary 4.8.3. x!(z, Z) = c; +2Re f ¢! dz. 
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Proof. sions Z=utiv, we may wmite dz = du + idv. (This is “differential” shorthand for 


dz du 
a ey je :.) Then 


b' dz = = [(xi — ixi)(du + idv)] = 5k du+xidv+i(xidv—xi du)], 


eS 1 : . | eae . . . 
gdi= 5 [(x), + ix) (du — idv)| = [xi du +x), dv —i(x), dv — xi du)| : 


We then have dx! = dz + dz = ¢' dz + ¢' dz = 2Re@¢ dz and we can now integrate to 
get x!. oO 


So, in a real sense, the problem of constructing minimal surfaces reduces to finding ¢ = 
(¢', ¢7, d°) with (¢)? = 0. A nice way of constructing such a @ is to take a holomorphic function 
f and a meromorphic function g (with fg* holomorphic) and form 


1 i 
g=5fl-8), b =sf(l+e’), & = fe. 
Exercise 4.8.4. Show that this @ satisfies (@)? = 0. 


¢ 


Exercise 4.8.5. Show that g = ———_. 
g —id 


A function g is meromorphic if all its singularities are poles. That is, around each singularity zo 
there is a Laurent expansion (generalizing the Taylor expansion) of the form 


b b 
B23 ae - _ j 
8(z) Cae ghia ee Cae + 2 aj;(z — 20) 


for some finite n with coefficients determined by g. For us, the most important examples of 
meromorphic functions are rational functions g(z) = P(z)Q(z) for polynomials P, Q. Therefore, 
we obtain 


Theorem 4.8.6 (Weierstrass-Enneper Representation I). Jf f is holomorphic on a domain D, 
g is meromorphic on D and fg” is holomorphic on D, then a minimal surface is defined by 
x(z, Z) = (x!(z, 2), x7(z, 2), x°(z, 2)), where 

x@p=Re ff s%)dz 

2,22 Re fi fc + g’)dz 


x3(z,Z) = Re2} fedcz. 


Before we look at some examples, it is worthwhile noting another form of the Weierstrass- 
Enneper Representation. Suppose g is holomorphic and has an inverse function g~! in a domain 
D which is holomorphic as well. Then we can consider g as a new complex variable t = g with 
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dt = g’dz. Define F(t) = f/g' and obtain F(r)dt = f dz. Therefore, if we replace g by t and 
f dz by F(t)dt, we get 


Theorem 4.8.7 (Weierstrass-Enneper Representation II). For any holomorphic function F(t), a 
minimal surface is defined by the parametrization x(z, Z) = (x'(z, z), x2(z, Z), 27(z, 2)), where 


x'(z,Z) =Re fo —1*)F(t)dt 
(2,2 = Re f i +1°)F(r)dt 


x3(z,Z) =Re 2f tF(t)dt. 
Note the corresponding 


¢= (5c — 1’) F(t), tl + ’)F(t), rF(s)) 


This representation tells us that any holomorphic function F(t) defines a minimal surface! Of 
course, we can’t expect every function to give complex integrals which may be evaluated to nice 
formulas. Nevertheless, we shall see that we may calculate much information about a minimal 
surface directly from its representions. In order to look at some of our standard minimal surfaces 
from the representation viewpoint, we need to recall some of the basic functions of complex 
analysis. For this, write z = u +i v and define 


e =e"(cosu+i sinv) and log(z) =Invu2 +02 +i arctan(~) . 
u 


We haven’t been precise here about branches of the log, but this technicality will not concern us. 
Using the definition of e?, we may define 


ez = et? ez + et 
sin Zz = ————— cos Z = —————. 
2i 2 
. e = e% e + e7% 
sinh z = coshz = 
2 2 


One reason these definitions are chosen is that they extend the usual real functions of the same 
name. For example, if z =u, then the definition of e* gives sinz as the real function sinw. 
Similarly, for z =u, sinhz = sinhu. While these complex functions have exactly the same 
differentiation, integration, sum and difference rules as their real cousins, it is often useful to 
expand the complex functions into their real and imaginary parts. To accomplish this, we replace 
z by uw +i v and use the definition of e% above. 
Exercise 4.8.8. Derive the following formulas: 

(1) sinz = sinucoshv +i cosusinhv. 

(2) cosz =cosucoshv +i sinu sinh v. 


(3) sinhz = sinhu cosv +i coshu sinv. 


(4) coshz = coshucosv +i sinhu sin v. 
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Example 4.8.9 (WE Representation of the Catenoid). 


Let F(t) = =15. Then, using the substitution t = e¢ below, we obtain 
2t g 


1 1 1 
sere fU-)25at Pare [+r yr at vaRe2 fr oodr 
2r? 2r2 T 


1 1 fol 1 l 
=Re f 5 -5at =Ref i] +5] dr =Re f 2 at 


I 1 
= —Re [=+5| =-Rei| = - 5] =Relnt 


2t 2 2r 2 

=2 z FE ones 4 z 
oie — = Rei = = Rez 
= —Re coshz = Rei sinhz =U. 
= —coshu cos v = —coshu sinv 


We obtain (without the negative signs) the standard parametrization of a catenoid, x(u, v) = 
(cosh u cos v, cosh u sin v, “). 


Exercises 4.8.10. Let F(t) = sit. Show that the representation associated to F(t) is a helicoid. 
Exercise 4.8.11. Show that the catenoid and helicoid, respectively, also have representations of — 
the form 


(f,g)= (->.-«) and (f.g)= (-: =e). 


Exercise 4.8.12. Show that the helicoid has another representation (f, g) = -§, 1). This rep- 
resentation seems simpler than the first two, but there is a small problem. Namely, the inte- 
grals of the representation are not path independent. They are then said to have real periods. 
Show that if we carry out the integration for x? (of the representation above) about the en- 
tire unit circle, we obtain a real period equal to 27. To fix this ambiguity, we take a smaller 
domain C/{0 UR7}. We therefore obtain one loop of the helicoid from this representation. 
Other loops may be similarly obtained and fitted together to create the whole surface. Thus, 
the simplicity of the representation (f, g) = (-$, 1) hides the subtlety underlying complex 
integration. 


Exercise 4.8.13. Let F(t) =i (4 - 4). Show that the associated representation is Catalan’s 
surface 


x(u, v) = (u —sinucoshv, ] —cosucoshv, 4sin = sinh =) ; 


Hint: after integrating, replace t by e~'*/? and use the expansion of sin z. 
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Exercise 4.8.14. Let F(t) =1—- +. Show that the associated representation is Henneberg’s 
surface 


2 2 
X(u,v) = (2 sinh u cos v — 3 sinh 3u cos 3v, 2 sinhu sinv + 3 sinh 3u sin 3v, 2 cosh 2u cos 2). 


Let v = § and show that you obtain Neil’s parabola (z — 2)? = 9x?. Hint: for the first part, after 
integrating, replace t by e*, use the expansions of sin z and cos z and replace y by —y. 


Exercise 4.8.15. Let F(t) = 1 (or, equivalently, (f, g) = (1, z). Show that the Weierstrass- 
Enneper representation gives you Enneper’s surface. If (f, g) = (1, z”), then an n-th-order En- 
neper surface is obtained. Calculate the x(u, v) for the 2nd-order Enneper surface. Hint: Maple 
might be useful here. 


Exercise 4.8.16. Let (f, g) = (2’, 4). Calculate the x(u, v) given by the Weierstrass-Enneper 
representation. This surface is called Richmond ’s surface. 


Exercise 4.8.17. Let F(t) = wa. Show that the Weierstrass-Enneper representation gives 
Scherk’s surface z = In(*). Hints: (1) for x!, use partial fractions to get 


cos x 


2 
[mr a5 dt =i log. —it) —¢ log(] + it), 


(2) since the log’s are multiplied by i and you want the real part, the definition of log(z) 
gives x! = arctan(;*;) — arctan(7=>) (ie, 1+it=1—v+iu and 1—it=1+v-—iu), 
(3) the formula for the tangent of a difference of two angles then gives 


! — arctan Be 
YS ——— }, 
{-—(u? + v?) 


? — arctan ad 
a + }, 
1 —(u? + v?) 


3 2 (u? — v? +1)? + 4u?v 
x? = Re(log(r? + 1)— log(r? — 1)) = zim In poe, 
(6) by drawing appropriate right triangles, write cos x? and cos x? in terms of u and v and show 
x? = In(cos x*/cos x'), Note that the integrals have real periods so that, just as for the helicoid, 
we can take one piece at a time to build up the whole surface. 


(4) similarly, 


(5) also, 


While we have seen that the Weierstrass-Enneper representations give us our standard array 
of minimal surfaces, their true value becomes apparent when we realize that we can analyze 
many facets of minimal surfaces directly from the representing functions (f, g) and F(t). In 
particular, this applies even to surfaces whose Weierstrass-Enneper integrals may not be explicitly 
computable. As an example of this, we shall compute the Gaussian curvature K of a minimal 
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surface in terms of F(t). First note that isothermal parameters give 


«= 7799 (55 (yee) +m (Ze) 
2VEG \ av EG du EG 
=-3($(F)+z(¢)) 

2E \av\ E Ou\E 

2E \ av? du? 


1 
= ——A(In E), 
OE (In E) 


where A is the Laplace operator x + S. By Exercise 4.8.1, we know that E = 2!@|’, so we 
compute E for ¢ = ($(1 — t?)F(t), $(1 + t?)F(r), t F(r)). We have 


B= 2/150 - )F ef +150 + FOr + eFC? | 


1 
= 5lFr [Ir? — 1) +(e? + 17 + 4)r/7]. 
Now, t? =u? —v? 4+ 2iuv, so [t? — 1]? = (uv? — v* — 1)° +. 4u*v*. Similarly, |t? + 1|* = 
(u2 — v? +1)? + 4u?v? and 4|t|? = 4(u? + v?). Then 
1 

E= 5IFI'2 [Ww yl £14470? 42 + 2v"| P 

=|F/* [u* + 2u?v? +4414 2024+ 2v"| 

=|FP[l+w4v). 


Exercise 4.8.18. Show directly from the definition of ¢ in terms of the representation (f, g) that 
2 
E=(fP[i +s’. 


Now, In E = In|F|* + 2In(1 + u? + v?) and it is an easy task to prove the following. 


8 
Exercise 4.8.19. Show that A(2 In(1 + uw? + v*)) = Trea 

We must now calculate A(In|F|?) = A(n FF) = A(in F + In F). Previously, we have seen 
that A = 487/8Z4z. Further, because F is holomorphic, F cannot be, so F/8z = 0 and, con- 
sequently, 9 In F/dz = 0. We are left with A(In F) = 48? In F/820z = 48(F'/F)/8Z = 0 since 
F, F’ and, hence, F’/F are holomorphic. Thus, A(In|F |?) = 0 and A(in E) = 8/(1 + u? + v’. 


Theorem 4.8.20. The Gaussian curvature of the minimal surface determined by the Weierstrass- 
Enneper representation II is 


-4 
K = ————_.. 
|F/?Q. + u2 + v2)4 
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Proof. From the calculations above, 


i 
C2 Rie 
ope) 


—8 
 2)F PC + u2 + v2) 
_ -4 
~ [FRC + v2 + v2)" 


Oo 


4 112 
Exercise 4.8.21. Use the identification F = £ to derive the formula K = Trans, in 
& 
terms of the Weierstrass-Enneper representation I. 


Exercise 4.8.22. Explain the apparent discrepancy between the two formulas for K . That is, the 
first formula never allows K = 0 while the second has K = 0 at points where g’ = 0. Hint: what 
was our assumption about g which allowed the transformation from Weierstrass-Enneper I to 
Weierstrass-Enneper II. 


Exercise 4.8.23. Recall that a point is umbilic if the two principal curvatures at the point are 
equal. Show that umbilic points on a minimal surface with representation (f, g) correspond to 
the zeros of g’. Hence, umbilic points on a minimal surface are flat. 


Exercise 4.8.24, A minimal surface described by (f, g) or F(t) has an associated family of 
minimal surfaces given by (respectively) (e’’ f, g) ore! F(t). Two surfaces of the family described 
by fo and f; are said to be adjoint if t; — t) = }. Show that all the surfaces of an associated family 
are locally isometric. Hints: (1) since we have isothermal coordinates, it is enough to show that 
E remains the same no matter what t is taken and (2) |e’’| = 1 forall t. 


Exercise 4.8.25, The catenoid has a representation (f, g) = (—e-*/2, —e*). Write the x(u, v) for 
its associated family and its adjoint surface. 


Exercise 4.8.26, Find the adjoint surface to Henneberg’s surface with F(t) = —i(1 — 4). Set 
u = 0 and show that the resulting v-parameter curve is the astroid x7/? + y?/? = (8/3)*/3. This 
astroid is a geodesic. 


So far, we have treated the Weierstrass-Enneper representation more from an algebraic and 
analytic point of view than from a geometric one. But, after all, it is the geometry of minimal 
surfaces which interests us, so it is reasonable at this point to connect the discussion above with 
our usual differential geometric constructs. For this, recall from Chapter 2 that the Gauss map of a 
surface M : x(u, v) isa mapping from the surface to the unit sphere S?, denoted G: M — S? and 
given by G(p) = U,, where U, is the unit normal to M at p. In terms of the parametrization, we 
may write G(x(u, v)) = U(u, v) and, for a small piece of M, think of U(u, v) as a parametrization 
of the sphere S*. Recall also that the induced linear transformation of tangent planes is given, 
for the basis {x,, xy}, by G.(x,) = U, = —S(x,) and G,({x,) = U, = —S(x,). A mapping of 
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surfaces 1: M — N given in terms of parametrizations (with the same parameters u and v) 
I(x(u, v)) = y(u, v) is a conformal map if 


Ey = Mu, v) Ey = A, vy Iau) + Le(Ku) 
Fy = Au, vy Fy = A(u, vy? Ie(%u) oy La(Xv) 
Gy =u, vy Gy = Alu, vy L(Xy) + Le (Xv) 


for a function A(u, v) called the scaling factor. Equivalently, as done below, we can simply show 
the equalities using lengths |x,,| and |x,|. / is conformal exactly when the metrics are proportional 
at each point p € M and its image I(p) € N. We shall study this notion more extensively in 
Section 5.5. 


Exercise 4.8.27. Consider the Gauss map ofa surface M: x(u, v) C R? defined by G(x(u, v)) = 
U(u, v), where U is the unit normal of M. For the helicoid x(u, v) = (ucosv, usin, v), the 
catenoid x(u, v) = (coshu cos v, coshu sin v, uv) and Enneper’s surface x(u, v) = (u — u?/3 + 
uv’, —v + v?/3 — vu?, u* — v*), show that the Gauss maps are conformal with respective scaling 
factors A(u, v) = 1+u?, cosh? u, (1+ u? + v2)°/2. 


In fact, we have the following general result. In Exercise 4.8.30, we see that this result geomet- 
rically characterizes minimal surfaces and spheres. 


Proposition 4.8.28. Let M: x(u, v) be a minimal surface parametrized by isothermal coordi- 
nates. Then the Gauss map of M is a conformal map. 


Proof. In order to show G to be conformal, we need only show |G,(x.)] = e(4, v)|Xu],1G.(X»)| = 
p(u, v)|x,| and G,(x,)- G.(Xy) = o?(u, v) x, - X,y for a scaling factor p(w, v). Since isothermal 
coordinates have E = G and F = 0, we get 


m m 
G,(x,) = U, = —_™ = EY , G(x») ae U, aes ze — EO 


where we have used the formulas for U,, and U, developed in Formula(s) 3.4.3. Taking dot 
products gives 


1 l 
2 2 2 2 2 2 
|Uu| =< +m’), |U,| = zl +n?] 
m 
Us Uo= = [+a]. 
But, by Exercise 4.7.3, / = —n, so we obtain 


I 
[UP = 5 [P +m?) =1U,? and U,-Uy =0. 


Since |x, | = /E = |x,| and x, - x, = 0, we see that the Gauss map G is conformal with confor- 


mality factor //* + m2/E. O 


Exercises 4.8.29, Show that the conformality factor //2 + m2/E is equal to ./|K|, where K is 
the Gauss curvature. 


4.8. The Weierstrass-Enneper Representations 195 


Exercises 4.8.30. Suppose M: x(u, v) is a surface whose Gauss map G: M > S? is conformal. 
Show that either M is (part of) a sphere or M is a minimal surface. For simplicity, assume that 
the patch x(u, v) is orthogonal (but not necessarily isothermal). 


Exercise 4.8.31. Show that a non-planar minimal surface M: x(u,v) cannot have constant 
Gaussian curvature. Hint: (1) if so, then define M: y(u, v) = ./[K|x(u, v) and show that M has 
constant Gauss curvature —] as well as the same unit normal as M. (2) Show the Gauss map of 
M isa local isometry. (3) Why is this a contradiction? 


Stereographic projection from the North pole is denoted by St: S*/{N} — R? and is defined 
by 


; ; cosucosv sinucosu 
St(cos u cos v, sin“ cos v, sinv) = {| ————_, ——_—_ 


l—sinv’ I|—sinv’ 

This is obtained by drawing a line through the North pole and a point on the sphere S* and 
then projecting that point to the point in R? where the line intersects. We will see in Chapter 5 
that stereographic projection is conformal with conformality factor 1/(1 — sin v). In cartesian 
coordinates, stereographic projection is simply St(x, y, z) = (x/(1 — z), y/(1 — z), 0). We may 
identify R? with C and extend St to a one-to-one onto mapping St: S* > CU {oo} with the 
North pole mapping to oo. With these identifications, we have 


Theorem 4.8.32. Let M: x(u,v) be a minimal surface in isothermal coordinates with 
Weierstrass-Enneper representation (f, g). Then the Gauss map of M, G: M > CU {oo}, may 
be identified with the meromorphic function g. 


Proof. The proof is a long calculation which brings together all of our complex analytic ingredi- 
ents. Recall that ¢ = #, 6 = 3 and 


az? 
o=sf(l—8) @=sflte) & = fe. 


We will describe the Gauss map in terms of ¢!, ¢? and ¢°. First, we write x, x x, = ((x, x 


Xy)!, (Xe X Xv), (Ku X Xp?) = (4243 — x3x?, x3) — x3, x}x? — x2x!). Let’s consider the 


first component (x, x x,)! = x2x3 — x3x?. We have 
= x3x? = Imf[(x? — ix2)(x3 + ix3)] 

= Im[2(4x?/dz) - 2(8x7/8Z)] 

= 41m(¢’¢"). 
Similarly, (x, x x,)? = 4Im(¢°¢') and (x, x x»)? = 41m(¢!'¢?). Hence, 

XX Xy = 41m(9’¢?, $°G', $'6") = 2 x 4), 
where the last equality follows from z — z = 2 Im z. Now, since x(u, v) is isothermal, |x, x x,| = 
Ix,}- [xy] = |x,/? = E = 2|@|?. Therefore, 
JEN OX) AXP 
[Xu X Xo] 21¢/? Ip? 


196 4. Constant Mean Curvature Surfaces 


We now compute the Gauss map G: M > CU {oo}. 


G(x(u, v)) = St(U(u, v)) 


(Pines ee 96) 

= St 

lel? 

(= __2Im@*!) ) 
~ \\@l? — 21m(¢'¢?)" |6|? — 2Im@!¢?)' ] 


The last equality follows since 


x _ 2Im(¢¢°) 1 
Sa. — bles zim e2y 
ews |d| 1 ~ ee 
= 2 Im(¢¢°) ; |p|? : 
lp? |p? — 2Im(¢'¢?) 
2Im(¢$’) 


~ |e? — 2Im@'$?) 
and similarly for y/(1 — z). Identifying (x, y) € R? with x + iy € C allows us to write 
2 Im(¢?$°) + 2i Im(¢°¢') 
Io? —2Im(@'¢?) 


Now, let’s consider the numerator NV of this fraction. 


N =2Im(¢¢°) + 2 Im(¢°4") 


G(x(u, v)) = 


= “0s —¢¢+i¢'¢! -1¢°¢'] 
=¢°¢'+i¢)-¢@' +i¢°). 
Also, 0 = (¢" = (¢'P +(¢’P + (HY = (G! — ig?! +167) + (6), 80 


; -(¢y 
SR giange 
Then we have, 
issn oo 
N=6O +P) 48 
_ PIG! -i¢?)G' +i ¢) + 1677) 
se ¢} -i¢? 


3 
Tt ar [oP + ie? + le? + iGo! — 976") 


3 
= 52g lio? - 21mo'#)]. 
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Hence, the second factor of the numerator VV cancels the denominator of G(x(u, v)) and we end 
up with 
¢ 
G(x(u, v)) = gee 


By Exercise 4.8.5, we know that g = som as well, so we are done. O 


Using the Weierstrass-Enneper representation II, we see that the Gauss map may be identified 
with the complex variable t as well. 


Exercise 4.8.33. A point p = x(uo, vo) is a pole of the Gauss map G: M > CU ovoif G(p) = 
oo. By the definition of stereographic projection, p is a pole of G if and only if U(p) = (0, 0, 1), 
the North pole. Verify this by showing that U(p) = (0, 0, 1) if and only if the denominator of the 
following expression for G vanishes: 


_ 2im(g?6*) + 21 Im(9°9") 
ORG San otaiglg 3 


Exercise 4.8.34. Prove Bernstein's Theorem: \f a minimal surface M: z = f(x, y) is defined on 
the whole xy-plane, then M is a plane. Hints: (1) In the proof of the existence of isothermal 
coordinates, if the parameter domain is the whole plane, then the potential functions yu and p 
can be extended over the plane as well. The mapping 7 then becomes a diffeomorphism (i.e., 
a smooth one-to-one onto map with smooth inverse) between the xy- and uv-planes. Therefore, 
we may assume that M has parameter domain the whole uv-plane where u and v are isothermal 
coordinates. (2) The normals for M are contained in a hemisphere. Rotate the sphere to get 
them in the lower hemisphere. (3) Think of the uv-plane as the complex plane C and look at the 
composition C > M S 9 /{N} — C. Why is this map holomorphic? Think of g. (4) Liouville’s 
Theorem in complex analysis says that a complex function defined on the entire complex plane 
which is both bounded and holomorphic is constant. 


4.9 Maple and Minimal Surfaces 


In this section, we will show how Maple can be used to plot minimal surfaces and to discover 
them as solutions to differential equations. 


4.9.1 Minimal Surface Plots 


First, let’s see how our standard minimal surfaces arise. See Exercise 4.2.8 for the actual pictures 
of some of the minimal surfaces we consider. Others have been seen throughout earlier parts of 
the book. Below, we list the parametrization of the surface, compute the mean curvature (using 
the “MK” procedure of Chapter 3) and give a plot command for the surface. It is left to the reader 
to carry out these plot commands. 


> catenoid:=<ulcosh(u)*cos(v) |cosh(u)*sin(v)>; 


catenoid := [u, cosh(u) cos(v), cosh(z) sin(v)] 
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> MK(catenoid); 
0 


> plot3d(catenoid,u=-1..1,v=0..2*Pi,shading=zhue,scaling = 
constrained, orientation=([0, 163] , lightmodel=light3) ; 


> helicoid:=<u*cos(v) |u*sin(v) |v>; 
helicoid := [u cos(v), usin(v), v] 
> MK(helicoid); 
0 


> plot3d(helicoid,u=0..1.5,v=0..5*Pi,shading=xy, 
orientation=[21,64] ,lightmodel=light3, grid=[10,60]); 


> gcherki:=<ulvlln(cos(v)/cos(u))>; 
scherk] := \. v, In (ey) 
cos(u) 


0 


> plot3d(scherki ,u=-1.57..1.57,v=-1.57..1.57,shading=xy, 
lightmodel=light4, orientation=(51,66] ,grid=[20,20]); 


> enneper:=<u-u°3/3+u¥v*2|-vtv73/3-veu"2|u72-v"2>; 


> MK(scherk1); 


1 1 
enneper := E = rl +uv’, —ut+ av _ vu’, uw? -+ 


> MK(enneper) ; 
0 


> plot3d(enneper ,u=-2.2..2.2,v=-2.2..2.2,shading=zhue, 
scaling=constrained, orientation=[94,42]); 
> catsurf:=<u~-sin(u)*cosh(v) |1-cos(u) *cosh(v) |4*sin(u/2) * 
sinh(v/2)>; 
uy, v 
catsurf := « — sin(u) cosh(v), | — cos(u) cosh(v), 4 sin (5) sinh (5)] 


The following command shows the limits of Maple’s simplification abilities. If a semi-colon is 
used instead of a colon, then Maple outputs a huge mess which does not appear to allow easy 
simplification. Nevertheless, Catalan’s surface is minimal. 


> MK(catsurf): 


> plot3d(catsurf ,u=0..4*Pi,v=-2.3..2.3,scaling=constrained, 
shading=zhue, lightmodel=light3, grid=[50,15] ,orientation= 
[-52,42]); 


The following procedure gives the evolution depicted in Figure 4.4. The “display” com- 
mand that follows gives an animation of the isometric deformation of the helicoid into the 
catenoid. 
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> helcatplot := proc(t) 

local X; 

X := <cos(Pi*t)*sinh(v)*sin(u)+sin(Pi*t) *cosh(v) *cos (uw) | 
~cos(Pi*t) *sinh(v) *cos(u)+sin(Pi*t) *cosh(v) *sin(w) | 

u*cos (Pitt) +v*sin(Pitt)>; 
plot3d(X,u=0..2*Pi,v=-2..2,scaling=unconstrained, orientation 
=[(46,52] ,shading=zhue, grid=[25,12] , lightmodel=light2) ; 

end: 


> display3d(seq(helcatplot(t/40) ,t=0..80) , insequence=true) ; 


4.9.2 The Minimal Surface Equation 


Now let’s see how Maple obtains the minimal surface equation. We take a Monge parametrization, 
compute its mean curvature by “MK” and set the numerator to zero. 


> monge:=<ulvlf(u,v)>; 


monge := [u, v, f(u, v)] 


> MK(monge) ; 

1(/ # a a 2 (#2 
5 (€ f(u, ») + (s5 f(u, ») (> f(u, ») + (a f(u, ») + 

a? a : a a a? 

(so f(u, ) (> f(u, ) —2 (> f(u, ») (> f(u, ») (sa f(u, »)) 
2 2\ 3/2) 
/( + (- f(u, ») + (> f(u, ) 
Ou du 


> factor (numer (MK (monge) ) )=0; 


a? a a 2 a? 
(so f(u, ») + (sa f(u, ») (> f(u, ») + (Ga fu, ») ae 


Be a é a a a? 
( f(u, ) (~ f(u, ) —2 (~ f(u, ») (> f(u, ») (sa, f(u, ») 
=0 


This is the minimal surface equation. 


4.9.3 A Geometric Condition: Minimal Surfaces of Revolution 


Let’s consider the situation of a surface of revolution with H = 0 from the Maple point of view. 


> surfrev:=<ulh(u)*cos(v) |h(u)*sin(v)>; 
surfrev := [u, h(u)cos(v), h(z) sin(v)] 
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We can take the mean curvature using the MK procedure above. 


> mean:=MK(surfrev) ; 


1 h(w) (4 h(u)) — 1 — (4 hwy? 
2 hw) (1+ (£2 hw)?)8/) 


mean -= 


We have H = 0, so we take the numerator of “mean” and solve the differential equation obtained 
when we set the numerator equal to zero. 


> numer (mean) =0; 
hwy h(u I (i h(u 7 =( 


> dsolve(numer(mean)=0,h(u)); 
1 
(e4 CD)? (e(-C2-Chy2 
C1 : 


11+ (e Dye (eC cy 
h(u) = = 
2 elt -CD ef-C2-Cl) _C] 


5 aes ) elt CD) ef-C2-Cl) 


h(u) = 5 


We get two answers because we need square roots to get u. We can take one answer as follows 
and solve for the function h(u). 


> so0l:=dsolve(numer(mean)=0,h(u)) [1]; 


a + (4 Cl) o(-C2 Cl) 


(4 -CDY? (e-C2-Chy2 Je 
Cl 


1 
sol :=h(u) = 5 


> ans:=simplify(solve(sol,h(u))); 
1 (1 $ 2 -ClU+-C2))) gf--Cl (ut.02) 


ans i= 5) Cl 


We often need special commands to simplify and this is no exception. 


> realans:=combine(convert(ans,trig) ,trig) ; 
cosh(u _C/ + _C2_C1) 
CI 


realans := 


We can pick _C1 and _C2 to see what this function is in one case. Note that _C1 must be nonzero 
since it is in the denominator. 


> func:=subs({_C1=1,_C2=0},realans) ; 


func := cosh(u) 
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So we see that a surface of revolution with H = 0 is a catenoid. 


Exercise 4.9.1. Use the process described above to discover all of the flat (i.e., K = 0) surfaces 
of revolution (see Exercise 3.5.9). The following commands may be useful. 


v 


h:=t->h(t); 

> surfrev:=<ulh(u)*cos(v) |h(u)*sin(v)>; 
> gauss:=GK(surfrev) ; 

> numer (gauss) ; 

> sol:=rhs(dsolve (numer (gauss)=0,h(u))); 
> funct:=subs({_C1=1,_C2=0},sol); 

> func2:=subs({_C1=0,_C2=1},sol); 

> func3:=subs({_C1=1,_C2=1},s01); 


> plot3d([u,funci*cos(v) ,funci*sin(v)] ,u=0..2,v=0..2*Pi, 
scaling=constrained, shading=zhue, lightmodel=light3, 
grid=[5,25] ,orientation=[55,35] ,axes=boxed) ; 


> plot3d([u,func2*cos(v) ,func2*sin(v)] ,u=0..2,v=0..2*Pi, 
scaling=constrained, shading=zhue, lightmodel=light3, 
grid=[5,25] ,orientation=[55,35] ,axes=boxed) ; 


> plot3d([u,func3*cos(v) ,func3*sin(v)] ,u=-1..2,v=0..2*Pi, 
scaling=constrained,shading=zhue,lightmodel=light3, 
grid=[5,25] ,orientation=[55,35] ,axes=boxed) ; 


4.9.4 An Algebraic Condition 


In Exercise 4.2.3, we stated that Scherk’s first surface comes about from placing an algebraic 
condition on a function generating a Monge patch: namely, f(x, y) = g(x) + h(y). Here we 
carry out the process of the exercise using Maple. 


> z2:=(x,y)->g(x) thly); 
z= (x, y) > g(x) + hy) 


The minimal surface equation for this function is 


> minsurfeq:=(1+diff (z(u,v) ,v)°2)*diff (z(u,v) ,u$2) - 
Qediff (z(u,v) ,u) «diff (z(u,v) ,v) «diff (diff (z(u,v) ,u),v) + 
(1+diff (z(u,v) ,u) *2)*diff (z(u,v) ,v$2)=0; 


minsurfeq : = 


d \ / ad d *\ / a 
( + (= hie) (G a) + ( + (+ at) (5 1) =0 
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This is a separable differential equation. We separate the u-parts on the left and the v-parts on 
the right. 


> leftside:=diff(g(u), ‘$‘(u,2))/(1+diff (g(u) ,u)*2); 
rightside:=-diff(h(v) , ‘$‘(v,2))/(i+diff (h(v) ,v)*2); 


a 
lefiside ‘= Bee 
1+ (7; 84) 
@ h 
rightside := av W() 


14+ (Lh) 


The only way that a function of u can always equal a function of v is if both functions are equal 
to the same constant C. We use this to solve each side separately and add the solutions to obtain 


fu, v). 


> dsolve(leftside=C,g(u)); 
In(_C/ sin(C u) — -C2 cos(C u)) 
gu) = Seas = a 

> dsolve(rightside=C,h(v)); 


_ In(—-C/ sin(C v) + -C2 cos(C v)) 


h(v) C 
> gg:=rhs(subs({_C1=0,_C2=-1},dsolve(leftside=C,g(u)))); 
; In(cos(C u)) 
eee 
> hh:=rhs(subs({_C1=0,_C2=1},dsolve(rightside=C,h(v)))); 
his In(cos(C v)) 
Cc 


> combine(ggthh) ; 
—In(cos(C u)) + In(cos(C v)) 
Cc 


We simplify by hand to obtain the function generating Scherk’s first surface. 


( cos(v) ) 
zz:=In 
cos(z) 


4.9.5 Maple and Area Minimization 


Maple may be used to plot minimal surfaces, of course, but there are more uses as well. First, the 
Weierstrass-Enneper representation may be put into a procedure which has as input a holomorphic 
function (in some domain) and as output an isothermal parametrization of a minimal surface. This 
is discussed in great detail in [Opr00] where many examples of minimal surfaces are created, so 
we will not repeat it here. Instead, let’s see how Maple may be used to visualize and calculate in 


> zz:=ln(cos(v)/cos(u)); 
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a specific problem. Namely, for certain boundary curves, we shall show that Enneper’s surface, 
although minimal, is not the spanning surface of /east area. The problem puts forth a candidate 
for smaller area — a generalized cylinder which is not even minimal. Here’s what Maple can do 
for this problem. 


> with(plots): 
> enneper :=<u-u73/3tuxv"2|-v+v73/3-v¥eu"2|u72-v72>; 


1 1 
enneper := . - 5 +uv’, —u+ 3 vy = vu’, yaa “| 


To identify a particular boundary curve easily, we convert Enneper’s representation to polar 
coordinates. 


> ennpolar:=simplify(subs({u=r*cos (theta) ,v=r*sin(theta)}, 
enneper)); 


1 
ennpolar := ae r cos(8) (—3 + 4r? cos(6)* — 3r7), 
l 
ae sin(@) (3 — r* + 4r* cos(6)"), r? (2. cos(6)* — | (4.9.1) 


The following procedure creates a parametrization for the cylinder spanning the boundary 
curve of Enneper’s surface with R = 1.5. In order to graph the cylinder, we need two cases — 
when yrheta is positive and when ytheta is negative. The second coordinate ytheta is negative 
when vu is between 0 and z, so here u must vary from 0 to 2. When v goes from 7 to 27, then u 
must range between —2 and 0. 


> CylEnn := proc(r) 

local xtheta,ytheta,ztheta,n,X; 

xtheta := r*cos(theta)-1/3*r73*cos(3*theta) ; 
ytheta := -r*sin(theta)-1/3*r73*sin(3*theta) ; 
ztheta := r°2*cos(2*theta) ; 

n := abs(ytheta); 

X := [xtheta,ythetatu*n,ztheta] ; 

end: 


We need to plot the two cases individually, so we save the plots by assigning names and 
remembering to use a colon instead of a semi-colon in order to suppress the plot structure 
output. 


> cyli:=plot3d(CylEnn(1.5),u=0..2,theta=0..Pi,scaling= 
constrained, grid=[5,50] ,style=patch): 

> cyl2:=plot3d(CylEnn (1.5) ,u=-2..0,theta=Pi..2*Pi,scaling= 
constrained, grid=[5,50] ,style=patch): 
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Similarly, we save the plot of Enneper’s surface in polar form. 


> enn:=plot3d(ennpolar,r=0..1.5,theta=0..2*Pi,scaling= 
constrained, grid=[5,50] ,style=patch) : 


The following represents the Jordan curve which is the boundary curve for both Enneper’s surface 
with R = 1.5 and the cylinder above. 


> jorcurve:=subs(r=1.51,ennpolar) ; 


jorcurve := [ — 0.5033333333 cos(@) (—9.8403 + 9.1204 cos(#)’), 
— 0.5033333333 sin(@) (0.7199 + 9.1204 cos(#)), 4.5602 cos(@)* — 2.2801] (4.9.2) 


We plot the boundary curve using “tubeplot” to create a small tube around it. This allows it to be 
seen much better than if we use “spacecurve”. 


> bound:=tubeplot (convert (jorcurve, list) ,theta=0..2*Pi, 
radius=0.025,color=black): 


Now we can display both surfaces with the same boundary curve. See Figures 4.11 and 
4.12. 


> display({bound,enn},scaling=constrained, style= 
wireframe, orientation=[154,-106]); 


> display({bound,cyl1,cy12},scaling=constrained, 
orientation=[154,-106]); 


Figure 4.11, Enneper’s surface inside the Figure 4.12 The cylinder inside the Jordan 
Jordan curve curve 
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Now let’s compute areas. The area of the cylinder is found by: 


> ytheta:=subs(r=1.5,ennpolar[2]); 
ytheta := —0.5000000000 sin(@) (0.75 + 9.00 cos(6)”) 


> xi:=diff(subs(r=1.5,ennpolar[1]) ,theta) ; 
z1:=diff (subs (r=1.5,ennpolar[3]) ,theta) ; 


x1 := 0.5000000000 sin(@) (—9.75 + 9.00 cos(6)) + 9.000000000 cos(6)’ sin(8) 
z1 := —9.00 cos(@) sin(@) 


Now we can evaluate the surface area integral of the cylinder (bounded by the boundary Jordan 
curve) numerically. 


> evalf (Int (2*abs(ytheta) *sqrt (x1*2+z172) ,theta=0..Pi)); 
31.66323514 


This is then the surface area of the cylinder inside the boundary curve. On the other hand, 
Enneper’s surface has surface area which may be computed explicitly to be wr7(1 + r? + (r*)/3) 
for radius r. Therefore, for r = 1.5, we have 


> evalf(subs(r=1.5, Pixr72*(1itr72+(r74)/3))); 
34.901 13089 


Hence, Enneper’s surface is minimal, but not area minimizing within the bounds of the Jordan 
curve above. Of course, the surface area of the cylinder could have been (and was) computed 
numerically without Maple. Yet, what recommends computer algebra systems to the mathematical 
and educational community is the convenience, the simplicity of the programming relative to the 
results obtained and the beauty of instant visualization. As final exercises for this section, we 
have 


Exercise 4.9.2. Compute the surface area of Enneper’s surface for radius r to be mr*(1 +r? + 
(r*)/3). Hint: use the surface area integral introduced in Section 4.2. 


Exercise 4.9.3. Compare the areas of cylinders of varying radii with Enneper’s surface. 


4.9.6 Maple and the Weierstrass-Enneper Representation 


In this section, we show how Maple can be used to create minimal surfaces from complex 
functions using the Weierstrass-Enneper Representation. For more information about this, see 
{Opr00]. First, we need to tell Maple what is real and what is complex. 


> assume(u,real) ;additionally(v,real) ;additionally(t,real) ; 
is(u,real); is(v,real); 
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Figure 4.13. The minimal surface for f(z) = < and g(z) = z3: The Bat 


Now, let’s simply write the formulas of Theorem 4.8.6 in Maple and have Maple calculate 
the complex integrals. (The third parameter a in the procedure below just tells Maple how to 
display the final result. As we saw in Section 4.8, sometimes it is convenient to change variables 
by z+ e orz++ e*/* and letting a = 1, 2 does this.) 


> Weierfg := proc(f,g,a) 
local Z1,Z2,X1,X2,X3,Z3,X; 


Zi := int(f*(1-g°2) ,z); 
Z2 := int(I*f*(itg"2),z); 
Z3 := int(2*f*g,z); 

if a=1 then 


Z1:=subs(z=exp(z) ,Z1) ; 

Z2:=subs (z=exp(z) ,Z2); 

Z3:=subs(z=exp(z) ,Z3) fi; 

if a=2 then 

Z1:=subs(z=exp(-I+z/2) ,2Z1); 

Z2:=subs (z=exp(-1+*z/2) ,Z2) ; 
Z3:=subs(z=exp(-I*z/2) ,Z3) fi; 

X1:= simplify(convert (simplify (Re(evalc(subs(z=u+I+v, 
expand(simplify(Z1))))),trig),trig),trig); 

X2:= simplify(convert (simplify (Re(evalc(subs(z=utI*v, 
expand(simplify(Z2))))),trig),trig),trig); 

X3:= simplify(convert (simplify (Re(evalc(subs(z=utl*v, 
expand(simplify(Z3))))),trig) ,trig), trig); 

X := [X1,X2,X3]; 

end: 
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Let’s take one example using the functions f(z) = z and g(z) = 2° witha = 0 (i.e., no special 
change of variables in the final form). 


> Weierfg(z,z73,0); 


] 7 35 z 1 1 1 
[gut gute Batata Tae at = abt ae 


2 
—ulvt+7Tww —Twv tu! —ud, pw 40 P+ Duet] 
To plot the minimal surface above, we simply use the following command. 


> plot3d(Weierfg(z,z73,0) ,u=-1..1,v=-1..1,grid=[40,40], 
shading=xy,style=patch, scaling=constrained, lightmodel=light2, 
orientation=[0,53]); 


And this is The Bat that adorns the cover of the book! 


Geodesics, Metrics and lsometries 


5.1 Introduction 


In axiomatic geometry the axioms focus on the characteristics of the fundamental objects of 
geometry, points and lines. Moreover, in order to test these axioms, certain models of geometry 
are constructed. For example, Riemann’s non-Euclidean geometry is modelled by the sphere with 
lines being the great circles (i.e., circles on the sphere having the center of the sphere as their 
own center). With these definitions, the sphere models a geometry where there do not exist any 
parallels to a given line through a given point. Of course the ramifications of such a result are 
tremendous. In particular, the angle sum of a triangle is greater than 180°. 

If differential geometry is to somehow connect with traditional axiomatic geometry, then it 
must produce its own abstract definition of line (as opposed to the Euclidean axiomatic non- 
definition of it) and show that this works for the typical models of Euclidean and non-Euclidean 
geometry. In particular, if we have the correct notion of line, then we should be able to prove 
(rather than assume or define) that the lines of the sphere are great circles! 

So what should the differential geometric notion of line be? Previously we showed that a 
straight line in the plane gives the shortest distance between two points. We could attempt to 
use this “distance-minimization” criterion as our definition, but it is a hard condition to check. 
Instead, let’s pick out another property of a straight line — a property which is readily calculable 
and which also characterizes the line — the vanishing of its second derivative. 

Now, we cannot take a curve a ona surface M in R? and require a” = 0, since this would just 
give us a straight line in R? and, for an arbitrary M (such as the sphere), there is no reason why 
such a line should remain on M. But, let’s think of all this from the point of view of a resident 
of M. That is, let’s take the viewpoint of a creature who lives on M and has no perception of 
the 3rd dimension given by the unit normal U. Because the creature cannot see anything outside 
of the tangent plane 7,,M, it cannot see the normal component of acceleration. Suppose a has 
unit speed. Then we have two perpendicular unit vectors T = a’ and U (the unit normal of M). 
A third such vector may be obtained by taking U x T. Now, three vectors which are mutually 
perpendicular form a basis of R?, so any vector is a linear combination of them. Hence, we may 
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write a” = AT + B(U x T)+ CU and calculate the coefficients as 
a”-T=A, a" -UxT=B, a” -U=C, 
where we use the fact that 7, U x T and U are unit vectors. Hence, we may write 
a” =(a"-T)T+(a"-UxT)(U x T)+(a”-U)U. 


Further, a’ - a’ = | since a has unit speed, so the product rule for differentiation gives a” - a’ + 
a’- a" = 2a’ -a” = 0. Hence a’ - a” = T - a” = 0 and we have no T7-component for a”, 


a” =(a"-UxT)(UxT)+(a"-U)U. 


The usual identities involving dot product and cross product give U-U x T =O anda” -U x 
T=U-a'xa",soU xT isinT,M forall p € M and 


a” -UxT=U.-a' xa” 
= |U| |a’ x a”| cos 0 
= |a’ x a” | cosd 


= Ky cos@ 


where xq is the curvature of the curve @ and @ is the angle between a’ x a” = T x «N and U. 
The quantity «, cos @ is often called the geodesic curvature of a and is denoted 


Ky = Kg COs. 


Exercise 5.1.1. Show that the following relation holds between the curvature x, ofa, the normal 
curvature k(a’) of a’ and the geodesic curvature k, of a: 


c= ka’ + eg 


Hint: Either (1) simply consider the expression for a” and recall how normal curvature is 
computed, or (2) consider how the normal curvature of a’ related to the curvature of a and note 
that V, B and U are in the plane perpendicular to 7 and that N is perpendicular to B. 


Exercise 5.1.2. For the topmost parallel (i.e., u = 5) on the torus, compute «, «, and k and show 
that the relation above holds in this case. 


Exercise 5.1.3. Let a(s) be a unit speed curve on M with a(0) = 0 € R? for convenience. 
Define the projection of a onto the tangent plane at a(0) by B(s) = a(s) + p(s) Uo, where 
p(s) = —a(s)- Up and where Up is the unit normal of M at a(0). Show that the geodesic 
curvature of a is equal to the ordinary curvature of B; (K,)q = «g. Hints: Use (0) = 0, p’(0) = 0 
and p”(0) =k, the normal curvature at a(0). Show |p’ x B”| = ,. This exercise says that 
geodesic curvature is nothing more than the ordinary curvature that a two-dimensional resident 
of the surface M sees along the surface. 


Exercise 5.1.4. Modify Example 1.3.29 to show that, in the case of a unit speed closed curve 
a of arclength L on a surface M, the initial rate of change of L when q@ is varied in a normal 
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direction is 
dLg(e) 
de 


~ feu x T)- dds 


e=0 
where 6 = a + € & and the variational vector field 5 satisfies a’ -5 = 0 and &-U = 0. The latter 


condition ensures that the curve stays “on” the surface for small €. This result will be used in 
Exercise 7.6.12. 


In Theorem 3.4.1, we saw that Gauss curvature only depends on the metric coefficients E, F 
and G. As we will see in Theorem 5.5.1, this means that Gauss curvature is a so-called “bending 
invariant”: that is, certain types of surface transformations called isometries always preserve 
Gauss curvature. In fact, the analogous result is true as well for geodesic curvature. 


Theorem 5.1.5. The geodesic curvature depends only on the metric. Indeed, taking F = 0 for 
convenience, the geodesic curvature is given by the formula 


E Gy Ex\ , Gy E Gy / 
ky = VEG( ~ a? + (FS - ) urv ge (Se - B)a oor) uly” — u'’). 


Proof. We have k, = a” -U x T as well as 
/ / t 
a’ =x, u' +X, v 
2 2 
of” = XU + Xyy ul +x, u" +Xy,U'v + Xyyu +X," 


Since we assume F = 0, {x,//E, x//G, U} is an orthonormal system. It is then easy to make 
the following calculations (with the help of Formula(s) 3.4.3): 


E G 
uxt = (= x z ) xt snnd= (Eau Ea), 


VE * JG Sao! = ate 
ligne (Bact ANy 
mee a) ag aS 


x, :-UxT=-VEGv’, 
xy-UxT=VEGu'. 


Now, after substituting these computations into kx, = a” - U x T and simplifying, we obtain the 
desired formula. O 


We have seen that we are able to decompose acceleration into tangential and normal 
components 


Qin = KgU xT — and w= (a U)U. 
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In keeping with our viewpoint as a resident of M, we see no acceleration exactly when 
ay, = 0. Therefore we make the following definition. A unit speed curve a in M is a geodesic 
ifay, = 0. 


tan 


Exercise 5.1.6. For a non-unit speed curve a(t) with speed v, show that 


Ww dv 2 7 
a mary fae UxT+(a’-U)U. 


Hints: (1) Recall T = © and differentiate a’ - a’ = v* to geta”-T = ay (2) Use the formula 
a’ 2 


for the curvature of a non-unit speed curve to show thata”- © x U =kgv’. 


The geodesics of a surface will be the lines of our geometry. We first note a very simple 
property of a geodesic (which also follows from the exercise above). 


Lemma 5.1.7. A geodesic has constant speed. 


2 


“ 


Proof. The speed of @ is v={|a’|, so v° =a’-a’. Differentiation yields ave = 
a’ +a’-a”’ =2a’-a” =0 since a” =a" tt 


Pee dv _ : 
‘ormal 2nd ayia’ =0. Hence 4 =0, so v is 
constant. 0 


dt 


Note also that a straight line a(t) = p + tq lying in M must be a geodesic because a” = 0. Of 
course, as we expect, this condition characterizes geodesics in our familiar Euclidean geometry. 
To see this from our definition, suppose P is a plane with normal U and a is a geodesic in P. 


By definition, a//,, = 0,so@” = (@” -U)U.Buta’-U = 0 since lies in P and the product rule 
gives, 


0=(a'-U) =a"-U+a'-U'=a"-U 


since U is constant. Hence, since both the tangential and normal components of a” vanish, we 
have a” = 0. Therefore, a must be a straight line. 

Another class of examples is provided by surfaces of revolution. Let a(u) = (g(u), h(w), 0) be 
a curve with unit speed parametrization. Recall that the associated surface of revolution (about 
the x-axis) has parametrization x(u, v) = (g(u), h(u)cosv, h(u)sin v) with 


E=x,-x,=1, F=0, G=x,-x,=h’? >0. 


Let’s differentiate E and F with respect to u. Recall that this means we take the directional 
derivative of these functions in the x, direction. Since E = | and F = 0, we have 


0= x,[E] 0=x,[F] 
= X,[Xy - Xu] = X,[Xy + Xv] 
= Xyy° Xy + Xy° Xyu = Xyu ° Xy + Xy ° Xyy 
= 2Xy ° Xun = Xyy* Xy 


since X, Xu» = (g’, h’ cos v, h’ sin v) - (0, —h’ sin v, h’ cos v) = 0. Hence, x,, +X, = 0 and Xy,° 
X, = 0, so x,,, is perpendicular to T7,(M). That is, (Xuw)tan = 0. Hence, meridians of surfaces of 
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revolution are geodesics. We shall generalize this statement shortly when we consider Clairaut 
parametrizations (see Proposition 5.2.7). 


Example 5.1.8 (Geodesics on S2). 

Let Se be an R-sphere with parametrization x(u, v) =(Rcosucosv, Rsinucosv, Rsinv). 
Here we see that the parametrization gives a surface of revolution with g(v) = Rsinv and 
h(v) = Ros v, so the v-parameter curves are geodesics. That is, longitudes from the North pole 
to the South pole are geodesics. Since the sphere is symmetric about its center, by an appropriate 
rotation we may consider it as a surface of revolution about any line which passes through 
the center. The v-parameter curves are then circles on the sphere which connect the points of 
intersection of the line with the sphere. Since these points correspond to the North and South 
poles under the rotation, the circles connecting them are great circles on S2. Recall that a great 
circle is a circle of radius R on an R-sphere and, therefore, has center the center of the sphere. 
This discussion shows that all great circles on $2 are geodesics. To see that great circles are the 
only geodesics on $2, suppose a is a unit speed geodesic. Then a, = a” — (a” -U)U = 0 by 
definition, soa” = (a@” - U) U. But, on $2, we know that U(a(t)) = a(t)/R, soa” = pla” -a)a. 
Also then, 


] 
(a’ x UY = Re x ay 


1 
= pin’ xa ta! x a’) 
=0 


since a” is parallel to a and a’ is parallel to itself. Then a’ x U “Wis a constant vector 
with a - AN = 0 since N is perpendicular to U = a/R. But this means that a lies in the plane 
having NV as normal vector. Further, since U = a@/R lies in the plane, from any a(t) we can 
get to the origin (0,0, 0) and stay in the plane. Hence, the plane passes through (0, 0, 0) as 
well. Since @ also lies on S3, it is in the intersection of the sphere with a plane passing through 
the origin. Therefore, it is (a part of) a great circle (see Figure 5.1), The following exercise 
provides another approach, in terms of the Frenet Formulas, to characterizing geodesics on the 
sphere. 


Figure 5.1. Geodesic on a sphere 
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Figure 5.2. Geodesic on a cylinder 


Exercise 5.1.9. Show that a curve a on S2 is a geodesic if and only if it is a great circle. Hints: (1) 
suppose @ is a unit speed geodesic witha’ = T anda” = T’=«N.Show S(T) = kyT — B= 
T/R, where S is the shape operator of S?,. (Note that the first part of the formula for S$ holds in 
general.) (2) What does this imply about xq and Ty? 


Exercise 5.1.10. Show that a geodesic on M, a, which is contained in a plane is also a line of 
curvature. Hint: ty = 0; the binormal B is the plane’s normal; B -U = 0. Another approach to 
this problem is found in Exercise 6.3.7. 


Exercise 5.1.11. Show that a curve a@ in a surface M is both a geodesic and a line of curvature 
in M if and only if @ lies ina plane P which is perpendicular to M (i.e., U is contained in P) 
everywhere along their intersection. Hint: Frenet Formulas; N = U; U’ = aT. Another approach 
to this problem is found in Exercise 6.3.6. 


Exercises 5.1.12. Find the geodesics on the cylinder M: x? + y* = R? (see Figure 5.2). Hints: 
(1) Write a(t) = x(u(t), v(t)) = (Reos u(t), R sin u(t), b v(t)) and take aw”. (2) A unit normal is 
U = (cosu, sinu, 0). Decompose a” = —R(#4)U + a4, find af, and set a//,, = 0. (3) What 
conditions are imposed on de and a9 


Another way to see that our notion of geodesic is correct is to consider the following special 
situation. Suppose M is a surface with a patch (near a given point) x having E = 1, F = 0 and 
G > 0. Un fact, such a patch near a point may always be found. It is called a geodesic polar 
coordinate patch. See Chapter 6.) For instance, a surface of revolution generated by a unit speed 
curve has E = 1, F = OandG = h(uy* > 0. Leta: [so, s;] > M bea unit speed geodesic. Then 


the arclength of a is 
Ss sy 
L(a) =] lads =) lds = s; — So. 


So So 


Now take another curve B: [so, 5;] ~ M with B(so) = a@(so) and B(s;) = a(s;). Note that we 
can always take the same interval [so, 51] by reparametrizing. Also, in an appropriate region, the 
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implicit function theorem allows us to write, B(s) = x(s, g(s)) for some function g of s. We then 
obtain 


B(s) = 


| 
a 
+ 
Led 
e 


lf 
pe 
=. 
+ &| 
ted 
Lj 
9 


and 


Xy ° Xy + 2X, Xygi +Xy- Xyg" 


=/E+Geg” 
= /1+Ge” 


> | 


\B'(s)I 


since G > 0. Hence, we have the following estimate for the length of 8: 


3] SI 5] 
L(p) = / [B'|ds = i, 1+ Gg" ds > / lds =s; — 5s) = L(a). 
SO So So 


Thus, geodesics minimize the distance between two points in (an appropriate neighborhood 
contained in) M. 


Exercise 5.1.13. Take the points (0, 1, 0) and (0, 1, wo) on the cylinder x? + y? = 1 and consider 
both the straight line and helical geodesics joining them. What goes wrong in the argument above 
since it is clear that the helical geodesic is not shortest length. Hint: here is a situation where 
the fact that a surface may not be completely covered by a single patch has a true geometric 
implication. 


5.2 The Geodesic Equations and the Clairaut Relation 


Now let’s try to get a handle on the calculation of geodesics in a more general way. In the following, 
we only consider orthogonal patches x(u, v) (i.e., F = x, -Xy = 0). Let a be a geodesic in the 
patch x. Then a = x(u(r), v(r)) and a’ = x,u’ + x,v’ with 


2 2 
a” = x,,u +X,,0u' +x,u" +x,,u'v' +x,0 +x,v". 


Using the formulas for x,,,, X,» and X,, obtained in Chapter 3, we obtain 


” x ee E, 72 A E, toe G, 12 
ao = Xy |u =U ——uv—- =v 
2E Re OR 


E G G 
+X, [." ~ eet + Gu + a" +U [iu + 2mu'v' + nv” | 
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where the first two terms give the tangential part of a”. For a to be a geodesic then, it is both 
necessary and sufficient that the following geodesic equations are satisfied. 


E, G, : ; 

u" + rear + me ‘yi - oF —*y* =0 (Geodesic Equations) 
E 

vy" aa ue a: é u'v’ + ae vw” =0. 


Exercise 5.2.1. Find the geodesics on the cylinder x? + y*=1 by using the geodesic 
equations. 


Exercise 5.2.2. Suppose that a curve @ satisfies the first geodesic equation and is also constant 
speed (i.e., |a’|? = u’*E + v”G =c). Differentiate the constant speed relation, replace u” by the 
first geodesic equation and show that you obtain the second geodesic equation. Therefore, the 
constant speed relation takes the place of the second geodesic equation in the case of a constant 
speed curve. Now, reverse your calculations to show that the two geodesic equations imply that 
the curve must be constant speed. This is a fact that is hidden by the geodesic equations, but 
which is essential. Hint: in the first part, be very careful with the chain rule. 


The geodesic equations are a system of 2nd-order differential equations. Given initial data 
consisting of a point on M and a tangent vector at the point, the theory of ordinary differential 
equations guarantees the existence and uniqueness of a geodesic on M through the point and 
having velocity vector equal to the given tangent vector. We state this formally as 


Theorem 5.2.3. Let p = x(ug, vo) be a point on a surface M: x(u,v) and let v € T,M. Then 
there is a unique geodesic a: (—r,r) > M with «(0) = p and a'(0) = v. 


Proof. We want to obtain a geodesic a(t) = x(u(t), v(t)) with the property that «’(0) = 

u’(0)x,,(uo, Vo) + v’'(0)xy(uo, Vo) = Vv. Since v is fixed, this gives prescribed values to u’(0) and 
v’(0). Together with the initial values u(0) = ug and v(0) = vo and by the basic existence and 
uniqueness theorems of differential equations, this is precisely enough information to determine 
a unique solution (in some interval about 0) to the geodesic equations. O 


Exercise 5.2.4. The plane and the sphere have the property that all geodesics are plane curves. 
Show that this property characterizes the plane and the sphere. That is, show that if M is a surface 
such that every geodesic is a plane curve, then M is a part of a plane or a sphere. Hint: Use 
Exercise 5.1.10, Theorem 5.2.3 and Theorem 3.5.2. 


Example 5.2.5 (Geodesic Equations on the Unit Sphere S*). 
Take the standard patch 


x(u, v) = (cosucosv, sinucosv, sinv) 
and calculate E = cos* v, F = 0 and G = 1. The geodesic equations become, 


; 2 
u” —2tanvu’v' = 0, v” +sinvcosvu’ = 0. 
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As it stands, this is a formidable system of nonlinear differential equations. In fact, it is hardly 
ever the case that the geodesic equations are solved directly. The sphere provides a situation 
where we can use a standard trick to determine geodesics. First, assume without loss of generality 
that a(t) = x(u(t), v(t)) is a unit speed geodesic. Then, besides having the geodesic equations, 
we also have a’ = u’x, + v’x,, which leads to the unit speed relation ] = Eu” + Gv". On the 
unit sphere this is simply 1 = cos? v u? +". Solve the first geodesic equation as follows: 


ul” 
[oe frm 
u 


Inu’ = —2Incosu+C 


c 
ul = —>— where c = e°. 
cos? v 
Now replace u’ in the unit speed relation by —c, to get 
2 
c 2 
1 = —— cos’ v + 
cos4 v 
2 
72 a c 
cos? v 


Dividing u’ by v’ produces the separable differential equation 


du +e 


dv cosvvcos? v — c? 


which may be integrated (by making the substitutions w = c//1 — c? tanv and w = sin@ in 
steps 4 and 5 below respectively and taking a + sign for convenience) to produce 


c 
a a 
cos vv/cos? v — c2 


csec? v 


= | ———- dv 
v1 —c* sec? v 


csec? v 


ay eee cn 
V1 —c? —c? tan? v 

=| dw 

Swe 

= [a 


= ares ( ail )+a 
Vi —c? 


Therefore, we have 


sin(u — d) = 2 tanv 
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where A = c/+/1 — c*. We may expand sin(u — d) = sinu cosd — sind cos u and find acommon 
denominator cos v to get 
sin u COS v cosucosv A sinv 


cos d — sind ————— — = 
COS U cos v COS U 


If we make the substitutions x = cosucosv, y = sinucosv, z = sinv and only consider the 
numerator, we get 


ycosd —x sind-Az=0. 


Hence, the geodesic equations imply that a lies ona plane ax + by + cz = 0 through the origin. 
Just as in our previous discussion of the sphere, this means that @ is contained in the intersection 
of a plane through the origin with the sphere — and this is a great circle. 

Before we leave the sphere, let’s look at one other piece of information provided by the geodesic 
equations. Let ¢ denote the (smaller) angle between a’ and x, at any point along the unit speed 
curve a. Then (using the angle + — @ between a’ and x, and the identity sing = cos(> — @)) 
we get 


f Xu 


a’. ] 
sing = ——* = —— [(u'x, + v’x,)-x,] =u'VE =u' cosy. 
|a"| [Xu ve! 0%] 
The angle between a’ and x, is actually > + ¢, so the left-most term of the equation should be 
+ sin @. However, by reversing the direction of a ifnecessary, we can assume @ < 2/2, in which 
case the displayed equation is correct. Now, u’ = (c)/(cos’ v) from the geodesic equations, so 
we obtain (by changing the sign of c if necessary) 


c 


sing = : 
COS U 

This is a special case of what is known as Clairaut’s relation (see Exercise 5.2.12). We shall 
discuss this in general shortly. For the moment, let’s look at what the relation implies about 
the behavior of geodesics (i.e., great circles) on the sphere. Suppose a geodesic @ starts out 
parallel to a latitude circle (i.e., a w-parameter curve) x(u, v9). This means that a’(0) is parallel 
to x,,(uo, Vo), so the angle @ is 4. The Clairaut relation says that 1 = sin | = (c)/(cos vo), so 
c = cos vp. Along a we then have cos v sing = cos vg and, since | sing] < 1, cos? v > cos? up. 
Because = <u< ei this means cos v > cos Ug. We then have the cases 


v 


1A 


Vo for0 < v, v9 < 5 


v > U9 for = = uve. 0: 


Since v represents latitude, these inequalities mean that the great circle a is pinched between the 
latitudes vp above and vo below the equator. 


Example 5.2.6 (Geodesic Equations on the Torus). 
Take the torus M with parametrization 


x(u,v) =((R+rcosu)cosv, (R+rcosu)sinv, rsinu). 


von WY suet ies 


OWS 1 
SON 


Catalan’s surface 


A perturbed Boy’s surface 


Enneper’s surface 


A helicoid 


Henneberg’s surface 


Scherk’s fifth surface 
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We know E = r*, F = 0 and G =(R+rcosu)’, so the geodesic equations are 


2 rsinu 
uv’ =0 v” — 2—_——__u''v' = 0. 


»  (R+rcosyz) . 
ue + ———— sin “v= 
r (R+prcosz) 


The second equation is separable and we have vu’ = c/(R +r cosu)*. Again assuming a is unit 
speed, we replace v’ in the unit speed relation by c/(R + r cosu)* to get 


2 
2.92 c 


l=r‘u ———____{ 
(R+rcosuy 


| ce 
u = —_} | —- —————_.. 
r (R+rcosu) 


Now we divide v’ by u’ to obtain 


dv _ cr 
du (R+rcosu)/(R +rcosuy — c 


v =} ee du. 
(R+rcosu)/(R +rcosu)? — c? 


Unfortunately, unlike the sphere, we cannot integrate the right-hand side explicitly. Therefore, 
the Clairaut relation takes on extra importance since it helps us to visualize the path of a 
geodesic. For instance, the Clairaut relation for the torus is (R +rcosu)sing =c, where ¢ 
is the angle between a’ and x,. Suppose aq starts out parallel to the topmost parallel circle 
x(>,v) =(Reosv, Rsinv, r). Then d = 5 and the Clairaut relation gives R = c. Again since 
|sing| < 1, wehave R+rcosu > Rallalong a. This implies that cos u > 0 and, consequently, 
—3 <u < 3. Hence, the geodesic is confined to the outside of the torus and, in fact, bounces 
between the topmost and the bottommost parallels (see Figure 5.3). 


What do these two examples have in common? For both the sphere and the torus, the patches 
are orthogonal and E and G depend only on u or only on v. We say that an orthogonal patch 
X(u, v) is a Clairaut parametrization in u if E, =0 and G, = 0. The patch is Clairaut in v if 
E, = 0 and G, = 0. Of course, as we have seen, the sphere is Clairaut in v and the torus is 


Figure 5.3. Geodesic on a torus 


220 5. Geodesics, Metrics and Isometries 


Clairaut in u. The geodesic equations simplify in these cases to 


ue + apt _ os =0 (u-Clairaut Geodesic Equations) 
G 
" wore 0 
vo + 6 u 
HW E, FF ms . . 
ue + ze v =0 (v-Clairaut Geodesic Equations) 
E G 
" vi 72 v 72 
-— —v’ =0 
2G « 2G 


In the following we shall focus on u-Clairaut parametrizations, but everything we say also 
applies to the v-Clairaut case as well. The geodesic equations give immediate information about 
some curves in the case of a u-Clairaut parametrization. 


Proposition 5.2.7. If M is a surface with u-Clairaut parametrization x(u,v), then the u- 
parameter curves are geodesics (when reparametrized to have constant speed). 


Proof. Take a u-parameter curve x(u, vg) for the u-Clairaut patch x and write it as a(t) = x(t, v9). 
Note that u(t) = t, v(t) = vp with u’ = 1, v’ = 0. Since the geodesic equations hold within them 
the constant speed relation, we must reparametrize a to have constant speed. Let’s do this 
explicitly since we will need the precise relationship below. The arclength for @ is 


s(t) = i, la'(y)Idy = i, JEW) dy 


_ as . : 
since a’(t) = x, -u' = x,. Hence, we obtain ra J E(t) > 0, so we have an inverse function 
t = ¢(s) and we define the unit speed reparametrization of a@ to be B(s) = a(t(s)). Of course, this 
means that 


B(s) = x(t(s), vo) 


with u(s) = f(s) and v(s) = vo. We then have 


Gi aa ges Tl 
ds dtds JE VE 
du 1 1 1 E, 


= E =~ 2 
ds? 2 B32" SE 2E? 
Now, the second u-Clairaut geodesic equation automatically holds because v’ = v” = 0. When 
we substitute the two u-derivatives above (where u’ now denotes the derivative with respect to 
S), the first geodesic equation gives 
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Therefore, the u-Clairaut geodesic equations are satisfied and the reparametrized u-parameter 
curve A(s) is a geodesic. O 


Hence, for a u-Clairaut patch, u-parameter curves are geodesics. Of course this is a general- 
ization of our previous discussion of surfaces of revolution. Also, note that we were required 
to reparametrize the u-parameter curve to have unit speed. While every curve can be reparame- 
trized to have unit speed, it is not often the case that u-parameter curves naturally come with a 
unit speed parametrization. Some authors give the name pre-geodesic to curves which are not 
constant speed, but when reparametrized to have constant speed, become geodesics. We shall 
simply continue to use the term geodesic for these curves. 

Since u-parameter curves of u-Clairaut patches are geodesics, it is natural to ask whether 
the same is true for v-parameter curves x(uo, v) of u-Clairaut patches. The geodesic equations 
become 


Gy 12 = Gy, 


Seo mae 


where v’ = 1 and G, is evaluated at uo. So, it is clear that the v-parameter curve x(uo, v) is a 
geodesic if and only if G,,(uo) = 0. In particular, this applies to surfaces of revolution where 
G(u) = h(u)’. Putting this together with the discussion above gives 


Theorem 5.2.8. Let M: x(u, v) be a surface with u-Clairaut patch x. Then every u-parameter 
curve is a geodesic and a v-parameter curve with u = Uo is a geodesic precisely when G,,(ug) = 0. 


Corollary 5.2.9. For a surface of revolution having parametrization x(u, v) = (g(u), 
h(u) cos v, h(u)sinv), any meridian is a geodesic and a parallel is a geodesic precisely when 
h'(ug) = 0. 


Exercise 5.2.10. Give an example ofa function A(z) so that the surface of revolution parametrized 
by x(u, v) = (u, h(u) cos v, h(u)sin v) has a parallel circle at u = ug which is a geodesic, but uo 
is not a maximum or minimum for A(z). 


In general, for a u-Clairaut parametrization x(u, v) and a unit speed geodesic a, we can reduce 
the second geodesic equation quite easily to first order. This follows just as for the examples of 
the sphere and torus above. The equation v” + Se y'y! = 0 becomes 
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Again, as in the examples, v’ in the unit speed relation may be replaced by c/G to give 
1 = Eu? + Gv” 


2 
(jn ee 


G2 
2 
12 c 
1l=E = 
u +S 
me Ge 
EG 
wot oe 
EG 


Exercise 5.2.11. Differentiate either of the last two equations and show that you obtain the 
first geodesic equation. Therefore, again, the unit speed relation takes the place of this geodesic 
equation in the case of a Clairaut parametrization. 


Now if we divide v’ by u’, we obtain a single integral which serves to characterize geodesics 
for a u-Clairaut parametrization. 


dv _ 
dues’ 
tps Gh 
+ [G2 
tceJE 
” VGVG-2 


cVE 
ae ara 


Exercises 5.2.12. Suppose a(t) = x(u(t), v(t)) is a unit speed geodesic and x is u-Clairaut. Show 
that the Clairaut relation /G sind = c holds, where c is a constant and ¢ is the angle from x, to 
a’. Hence, show that a cannot leave the region of the surface for which G > c?. 


Exercises 5.2.13. Show that geodesics in the plane are straight lines. Hint: Use polar coordinates 
x(u, v) = (ucosv, usinv). 


Exercises 5.2.14. Let M: x(u, v) = (ucosv, u sin v, au) denote a cone. Show that a unit speed 
geodesic a(t) = x(u(r), v(t)) on the cone is characterized by the equation u = c sec(v/V/1 + a? 
+ D). For a= 1, determine c and D for the geodesic connecting (1,0, 1) and (0, 1, 1) and 
compare the arclength of this geodesic between these points to the arclength of the parallel circle 
joining the points (see Figure 5.4). Hint: see Subsection 5.6.2. 


Exercise 5.2.15. Figure 5.5 shows a geodesic on the surface of revolution obtained from the 
Witch of Agnesi. The geodesic begins paralle! to the parallel circle having u = 2/3. Explain the 
behavior of the geodesic in terms of the Clairaut relation. 
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Figure 5.5. Geodesic on the whirling witch of Agnesi 


Before we go on, let’s look at the Clairaut relation from a physical viewpoint. Consider 
a surface M and suppose a particle is constrained to move on M, but is otherwise free of 
external forces. D’Alembert’s principle in mechanics (see [Am78]) states that the constraint 
force F is normal to the surface, so Newton’s Law becomes |F| U = ma”, where |F| de- 
notes the magnitude of F, U is M’s unit normal and a(t) is the motion curve of the particle. 
Taking m = 1, Newton’s Law tells us that the x, and x, components of acceleration vanish 
and, as we have seen, this leads to the geodesic equations. Hence, a freely moving particle 
constrained to move on a surface moves along geodesics. Now suppose M is a surface of rev- 
olution parametrized by x(u, v) = (h(u)cosv, h({u)sinv, g(u)). Along the surface, the radial 
vector r from the origin is simply r = (A(u)cosv, h(u)sinv, g(u)) and the momentum vector 
of the particle’s trajectory is given by p = a’ = u’x, + v’x, (since m = 1). Note that, since a 
is a geodesic, we may assume its speed is constant. (This also may be inferred from conser- 
vation of energy.) The angular momentum of the particle about the origin may be calculated 
to be 


L=rxp 
= ((g'h — gh')u' sinv — ghv' cosv, —(g’h — gh’)u' cosv — ghv' sinv, h’v’). 


Because no external forces (or, more appropriately, torques) act on the particle (also see [Lun91]), 
we have 


Proposition 5.2.16. The z-component of the angular momentum is conserved. 
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Proof. Let’s write the z-component of L as L, = k-(a@ x a’) (where we identify a = r and 
a’ = p. We then compute the derivative to be: 
dL, 
—=k-(a' xa’)+k-(ax a”) 
dt 
= |a"|k-(@ x U) 


because @ is a geodesic. If we write a = x(u, v) = (h(u)cos v, A(u)sinv, g(u)) and 


Xu XXy _ (—g’hcosv, —g’hsinv, hh’) 


|Xu X Xy| IX, X Xy| 


> 


then we easily calculate the z-coordinate of a x U to be zero. Thus, we have k-(@ x U) = 0 
and, therefore, dL. /dt = 0 as well. Hence, L, is constant. im 


Therefore, we see that h*v’ = C, where C is a constant. Further, if ¢ is the angle from x,, to 
a’, then 


la'|hsing =a’ -x, = (u'x, + 'x,)-x, =h?v' =C 
since cos(> — @) = sing andx, - x, = h?. Because |a’| is constant as well, we have 
h sing = constant. 


But this is precisely the Clairaut relation since h = JG. In this way the Clairaut relation is a 
physical phenomenon as well as a mathematical one. (Also see Exercise 7.6.16 for a derivation 
of the Clairaut relation using Hamilton’s principle.) 


Exercise 5.2.17. Let M denote the elliptic paraboloid z = x* + y* parametrized as a surface of 
revolution by x(u, v) = (ucos v. usin v, uw?) and let a = x(u(t), v(t)) be a unit speed geodesic. 
Find v in terms of u and show that the Clairaut relation is u sing = c. Show that a non-meridian 
geodesic spirals up the paraboloid and crosses every meridian an infinite number of times (see 
Figure 5.6). Hints: (1) use the Clairaut relation (2) the integral for v diverges, so a can’t approach 
a meridian as a limit. 


Figure 5.6. Geodesic on a paraboloid 
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Exercise 5.2.18. For the paraboloid M and geodesic a above, determine how low on M the 
geodesic a can go in terms of the constant c in the Clairaut relation. What happens to the geodesic 
when it reaches its lowest height? From this, show that a non-meridian geodesic intersects itself 
an infinite number of times. Hints: You may use the following result which we will not prove. 
Theorem ({dC76, section 4-7, p. 302]): If a geodesic approaches a parallel as a limit, then the 
parallel is a geodesic. What parallels are geodesics on the paraboloid? 


Exercise 5.2.19. Determine v as an integral for a geodesic on the catenoid. Show that the Clairaut 
relation cosh u sing = c determines the behavior of geodesics approaching the parallel circle with 
ug = 0. Hint: take a geodesic starting at x(uo, v9) with @ = ¢p and consider the following three 
cases: cosh ug sin gg = c < 1, coshug sings = c = 1 and coshugsingyp = c > I. 


Exercise 5.2.20. Determine v as an integral for a geodesic on the hyperboloid of one 
sheet x? + y?—z*?=1 and give the Clairaut relation. Show directly that the ruling of 
the hyperboloid which passes through (1,0,0) satisfies the Clairaut relation. Find a 
closed geodesic on the hyperboloid. Hints: (1) the ruling may be parametrized by a(v) = 
(1, tan v, tanv) (why?) and a parallel circle may be parametrized by 6(v) = (coshupcosv, 
cosh uo Sin v, sinha). What happens on the central parallel (cos v, sinv, 0)? (2) A geodesic 
a: [a,b] > M is closed if a(a) = a(b) and a’(a) = a’(b). When are parallels geodesics? 


5.3 A Brief Digression on Completeness 


In the discussion above, at various points, we have implicitly assumed that geodesics “run 
forever”. That is, we have taken for granted that a geodesic is a unit speed curve a: R- M 
whose domain is the real numbers. That this is not always the case is apparent from looking at an 
example such as the plane minus the origin, M = R? — {(0, 0)}. We know that geodesics in the 
plane are straight lines, so the same must be true for M since the same geodesic equations hold. 
Suppose a geodesic starts at (r, s) in the direction (—r, —s). We know by Theorem 5.2.3 that there 
is a unique geodesic of this type which heads toward (0, 0). Since the geodesic is unit speed, its 
arclength (i.e., distance travelled) corresponds to the length of the interval (a, b] on which it is 
defined. Because the geodesic cannot pass through the missing origin, it can only go a distance 
of less than /r2 + s?. Therefore, the interval [a, b] cannot be all of R. What goes wrong here? 

In order to answer this question, let’s make a definition. Say that a surface M is geodesically 
complete if every (unit speed) geodesic has domain R. The importance of this definition is 
reflected in the following beautiful result which we will not prove. 


Theorem 5.3.1 (Hopf-Rinow). /f M is geodesically complete, then any two points of M may be 
Joined by a geodesic which has the shortest length of any curve between the two points. 


When we speak of being a resident of some surface M, we naturally assume that we can get from 
any point to any other by some shortest-distance path. If we live in a non-geodesically complete 
surface however, this is just not so. In M = R* — {(0, 0)}, we cannot go from (—1, 0) to (1, 0), 
say, by a geodesic. Indeed, there is no shortest path between these points. So, the question now 
becomes, how do we recognize geodesically complete surfaces? The example of the plane minus 
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the origin gives a clue. Recall that a subset M C R? is closed if any convergent sequence z; > z 
with z; € M also has z € M. A sufficient condition for a surface to be geodesically complete is 
given by 


Theorem 5.3.2. 4 closed surface M C R? is geodesically complete. 


Proof. Let a be a unit speed geodesic on M. If a(s) is defined, then the fact that @ arises as a 
solution to the geodesic equations guarantees that a is defined on an open interval (s — 7,5 +7), 
for some r. This means that every s € R for which @ is defined has a small interval about it of 
other points for which a is defined. Hence, the subset of R consisting of points for which a is 
defined is an open set — that is, a union of open intervals. Let 7 = (a, b) be one of these intervals 
where we assume b < o0 and BD is a least upper bound for 7. We shall show that a is, in fact, 
defined for b as well. Hence, / has no least upper bound (i.e., b = 00). A similar argument for a 
then shows that a is defined on the whole real line. 

Let z; — b be a sequence of points in / converging to the least upper bound b. Since the 
sequence converges, it is also Cauchy. That is, given « > 0, there exists an integer N such that 
for all n, m > N, |Z — Zm| < €. Now choose € > 0 and look at a(z,,) and a(z,,). Since q@ is unit 
speed, its arclength from a(z,,) to @(Z,) is simply |Z, — Zm| < €. Also, as we saw in Chapter 1, 
the (shortest) distance between two points p, qg € R? is given by the line joining them and is 
\q — p|. Hence we have, 


|a(Zn) = a(Zm)| < 124 = Zin <€. 


This means that the sequence {a(z;)} is Cauchy as well. Now, it is a fact of analysis that Euclidean 
space R” for any n is complete in the sense that every Cauchy sequence converges, so we have 
a(z;) > w for some w € R?. But a(z;) € M for all j and M is closed by hypothesis, so w € M. 
Hence, we may define a(b) = w, showing that a may be defined at b as well and contradicting 
b’s definition. We note that we have only shown that a may be extended continuously to b. Some 
rather technical arguments indeed show that a extends as a smooth geodesic. O 


Exercise 5.3.3. Which of the surfaces we have studied up to now are geodesically complete? 


5.4 Surfaces not in R? 


Up to this point we have taken surfaces in 3-space together with the inner product structure 
naturally associated to any Euclidean space, the dot product. But there are other inner products 
which are available. Indeed, any non-singular symmetric n x n-matrix A gives an inner product 
on R" by taking the matrix multiplication 


(x,y) =x'Ay 


for x, y € R". Here x’ denotes the transpose of the column vector x to a row vector. The dot 
product is simply the inner product which takes A to be the identity matrix. In some sense, 
the geometry which we see on surfaces in 3-space reflects the geometry of 3-space itself. If we 
wish to extend our notion of geometry beyond 3-space we must somehow rid ourselves of our 
dependence on the dot product — or, at least, modify this dependence. Recall the definition of the 
dot product in R?. (We use vectors in R? because tangent planes of surfaces are 2-dimensional.) 
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Let x = (x1, x2), y = (v1, y2) € R? and define 


1 0 
xy = (41, x2) & 4 i = x1 y + X22. 


Now, change the matrix to [ ue Al for a > 0 and find 


l/a 0 XY + xy x: 
roy = (ni) "f Wal (21) = Ate = 


Tae 


where “-” is the dot product and “‘o” is the modified dot product. All the usual properties of the dot 
product (such as symmetry, bilinearity etc.) work for the modified version as well. For an inner 
product on the tangent planes of a surface M the number a may change at each point p € M. 
That is, we may take a to be a function on M, 


a=f(p) forf:M—>RandpeM. 


We take the square of the function f(p) to ensure that a is positive at each point. Thus, for two 
tangent vectors v, w € T,(M) we have, 


= Vv: W 
 f(pyP’ 


We say that - and o are metrics of M since they allow us to calculate E, F, and G, the basic 
components of distance along M. The dot product is usually called the Euclidean Metric and a 
metric such as 0 is said to be conformal (to the Euclidean metric) with scaling factor f. 

Now, if these surfaces do not use the Euclidean metric, then where are they? While they may 
sit in R? as sets, their different metrics show that they don’t inherit geometry from R?. So, as 
surfaces with a metric, they do not sit inside R*! But if we cannot use the structure of R? — 
for example, the unit normal — how can we understand the geometry of such a surface? As is 
typical in mathematics, when we extend beyond our usual situation, we use previous theorems as 
definitions. Because we no longer can use the unit normal U to define Gauss curvature K, instead 
we invoke Theorem 3.4.1 to define K. Note that this only makes sense because Theorem 3.4.1 
ensures that K depends only on the metric. Therefore, 


vow 


Definition 5.4.1 (Important Definition). For a surface with orthogonal metric (i.e., F = 0), the 
Gauss curvature K is defined to be 


case (Be) +b (Ge) 

2VEG \ av EG du E , 
Of course, because this formula was a theorem for surfaces in R°, this definition agrees with our 
original definition in case M has a metric induced by the Euclidean metric of R?. 


Exercise 5.4.2. For a conformal metric with scaling factor f, show that 


Kafe twa ee 
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A function f is harmonic if fuu + foy = 0. In this case, clearly, K < Oand K = 0 only at critical 
points of f. Define a conformal metric on R? — {(0, 0)} by taking f = 5 In(u? + v?). Compute 
K and draw the curves in the plane where K is constant. 


As the following examples show, we can still define surfaces by taking patches x in R?, but 
with a conformal metric rather than the induced metric. We shall see that we can do geometry in 
this situation just as before. Understanding this abstract differential geometry is the first step to 
understanding the geometry of higher dimensions. 


Example 5.4.3 (The Poincaré Plane P). 

Define P to be the upper half-plane P = {(x, y) € R? | y > 0} with the patch x(u, v) = (u, v) 

and with the conformal metric, 

wi WwW 
y2 


Wi oW2 = where wi, W2 € 7,(P) and p = (u,v). 


This definition of the metric means that the usual dot product is scaled down by the height of p 
(1.e., v) above the x-axis. Let’s compute E, F and G (at p). We obtain 


x, = (1,0) Xy = (0, 1) 


1 


E=x,0x%,=-~ F=x,0x, =0 G=xX,oxX, =. 
v 


y2 


2 
Note that G, = 0 and E, = aor Hence 
Vv 


Thus, P has constant curvature equal to —1 at each point. P is then a “negative curvature” 
analogue of the unit sphere in R?. 


Exercise 5.4.4. Define M = {(x, y) € R* | y > 0} to be the upper half-plane with the patch 
x(u, v) = (u, v) and with the conformal metric, 


WwW; : W2 


WwW) OW? = where wi, W2 € 7,(P) and p = (u,v). 


Calculate the Gauss curvature K of M. 


Example 5.4.5 (The Hyperbolic Plane H). 
Define H to be the disk of radius 2 in the plane minus the bounding circle with patch x(u, v) = 
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(ucosv,usinv) 0<u <2, 0<v < 2m and conformal metric, 
w, Ww 


2 
W, ow = (—u2/4p _ u2/4y 


where w;, W2 € 7,(H) and p = (ucosv, usin v). 


Exercises 5.4.6. Show that K = —1 at each point. 


Example 5.4.7 (The Stereographic Sphere S%,). 

Let S%, denote the unit sphere minus the North pole S* — {(0, 0, 1)} and define a map St: S?, > R? 
(where R? is the xy-plane) by taking the point in R? which is the intersection of R? and the line 
in RR? determined by a point on the sphere and the North pole N. Formally, given a point 
Pp = (coSucos v, Sinu cos v, sinv) on Lye the line joining p and N is given by 


y(t) = (0, 0, 1) +t (cosu cos v, sinu cos v, sinv — 1). 


The line y intersects R* when the third coordinate is zero. This occurs when | + t(sinv — 1) = 0 
ort = 1/(1 —sinv). Hence, 


St(cosucosv, sinucosv, sinv) = ( 


cosucosv sinucosv 
l-—sinv’ l—sinv’ 


Clearly, St is a one-to-one onto map from Sj, to R’. Recall that the induced linear transformation 
of tangent vectors St, may be calculated by simply taking appropriate derivatives of the image u 
and v curves. For example, fixing a particular v and differentiating gives 

d (= cosv sinucosv ) 


St.(x,) = — 


du \ 1—sinv’ 1—sinv’ 


—sinucosv cosucosu 
l—sinv ’ 1—sinv’ 
Similarly, differentiating with respect to v gives 
COS u sin u 
St,(x,) = {| ———-_, ——_,, 0 
l—sinv 1—sinv 
We may now define a new metric on S3, by saying 
W1 0 W2 = St.(W1) - St.(W2) 


where - denotes the sphere’s induced R? metric. 


Exercise 5.4.8. In the metric o on Si,, show 


__ costy oh 1 
~ (1—sinv?’ =o 


and calculate the Gauss curvature K. Explain your answer. 


E 


Example 5.4.9 (The Flat Torus 7yat). 

Although we have spoken of “surfaces not in R*”, we have so far used only examples where 
the underlying point sets are in R*. But it is possible to define point sets in Euclidean spaces 
of dimension different from 3 and to give these point sets a metric derived from the Euclidean 
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space. A famous instance of this is the flat torus in R*. Let Tax be defined by the patch x(u, v) = 
(cosu, sinu, cosv, sinv) with 0 <u <2, 0<v < 2m as domain, range IR*, and with the 
induced metric (i.e., dot product) of R*. Just as before, we may calculate x,, and x, componentwise 
to gt E=1!, F =O and G = 1. Hence, K = 0 and 7g, is flat. We emphasize again that this 
example does not come from a conformal metric, but rather from a Euclidean metric of a higher 
dimension. 


Exercise 5.4.10. Verify the calculations for the flat torus Ta, and then show that the name is apt 
by defining a one-to-one onto map from 7g, to a usual torus. Since the flat torus is closed and 
bounded in R*, it is compact. Can the flat torus be embedded as a surface in R? (i.e., Tage C R? 
with the induced metric of R?)? Explain. 


Beyond having a definition of Gauss curvature, however, we would like to understand the 
nature of these surfaces in terms of their geodesics. Again we are faced with the problem of 
removing all traces of the unit normal U from our previous discussions of geodesics. One way is 
easy. When we calculated the Christoffel symbols for x,,,, X,, and X,,, we never used U. Hence, 
the same formulas hold for surfaces not in R?. In particular, the calculation of a” remains the 
same, but without the final term involving U. That is, 


Ey 72 cE E, to G, 12 
u uv Vv 
2E E 2E 


a” =x, " + 


For @ to be a geodesic then, just as before, it is both necessary and sufficient that the geodesic 
equations are satisfied. Now let’s look at the original way we defined geodesics in terms of 
geodesic curvature. Recall that, for a unit speed curve a, we had 


a” =K,U xT +(a"-U)U. 


Of course, we do not have a U now, so we must drop off the last term and somehow make 
sense of the first. The tangent vector U x T was used before to obtain an orthonormal basis for 
R?,{T, U x T, U), where T and U x T provided an orthonormal basis for the tangent plane at 
every point of the surface. Although we do not have a unit normal to work with for surfaces not in 
R?, we may still find an appropriate tangent vector in each tangent plane which is perpendicular 
to T. The way to do this is as follows. Given a patch x(x, v), define a linear transformation J of 
each tangent plane by 


1(4)=+ 1(3:)= Xy 
VE) JG VG VE 
Note that defining J on a basis for each tangent plane suffices to define J completely. Also note 


that J is defined on unit vectors and has unit vectors as outputs. This is to ensure that J has no 
effect on lengths, but, rather, only rotates vectors on each tangent plane. 


Exercise 5.4.11. Suppose a: / > M is acurve ona surface M in R? (i.e., with R®’s induced 
metric). For convenience, take a to have unit speed. Recall that patches are now assumed 
to be orthogonal; that is, F = 0. Show that J/(7) = U x T. Hint: ao’ = u’x, + v’x, and J is 
linear. 
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Since J(T) = U x T (by Exercise 5.4.11) for surfaces in R°, we can take the following equation 
as the definition of geodesic curvature for a unit speed curve a(t). Of course the non-unit speed 
case is just as before, but without the U term. Now, 
@ Ke FL) 
J(T) = J(u'x, + v'x,) 
= ul J (x,) + v' I (Xp) 


TE. 6 
=u cr EM 
a” Kev’ a +kpu' ae 
=~ fp kut ket’ | =X. 
8 E g G 


If we equate the x, coefficient of this expression for a” and the one we used to define geodesic 
equations, we have 


? G ” Ey 12 E, roe G, 2 
Kev ye =u + 5p¥ eee Yoke 


ao ER Bee Fe ol. AG 
G=——_/Az- oS + ui+ v. 
vu YG 2VEG v VEG 2VEG 
We may write all this in a much more meaningful way. Let @ denote the angle between a’ and 
X,. Since @ is unit speed, we have 


so 


a’ =u'x, +x, 


Xu “ Xp 
= cos@ —— + sin@ ——. 


VE VG 


Therefore, u’/E = cos@ and v'/G = sin@. If we differentiate the first expression and substitute 
in the second, we obtain 


j ,E,u'+E,v'  dcosé 
pee 


2VE dt 
” E, ie” E, u'v' R dé 
Uu VE + + = — sin? — 
QE: . “O67E: dt 
E, 2 E. uv’ de 
Ee eG yh 
IE. IGE dt 


—u" [E  Eyu” —Eyu' 
vu VG wVEG 2WEG dt 
Comparing this with the expression for «, above gives the following 


Theorem 5.4.12. Let a(t) be a unit speed curve in a surface M: x(u, v) and let @ denote the 
angle between a’ and x,. Then the geodesic curvature of a is given by 

dé 
a dt 


K 


1 
G,v — E,u'|. 
+ ee! : “ 
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Note that we have not specified a range for the patch x(u, v). This is simply to indicate that 
the expression for x, holds for surfaces not in R? as well as those in R?. We shall meet up 
with geodesic curvature again in Chapter 6 where we shall derive the above formula for x, ina 
different way. 

Now let’s find some geodesics. Our previous approach in terms of the geodesic equations still 
works here of course. In particular, the notions of u and v-Clairaut still have meaning and still 
simplify the geodesic equations greatly. Note, however, that the property of being a Clairaut 
patch is much more dependent on the definition of the metric now than was the case when we 
could safely rely on R*’s dot product. 


Example 5.4.13 (Geodesics on the Poincaré Plane P). 
The patch x(u, v) = (u, v) with the conformal metric, 


Ww, - W2 
v2 


WL, OW = where wy, W2 € 7,(P) and p = (u,v) 
is v-Clairaut, so, for a = x(u(t), u(t)), the geodesic equations give 


2 1 
u” — —y'v' =0, vp’ + -u — -v" =0. 
v v 


Since the patch is v-Clairaut, the usual procedure for calculating geodesics gives, from the first 
geodesic equation, either u’ = 0 or 


Tid 2 , 
ea” 
u v 
In(u’) = 2In(v) +c 
uo=cv’. 


Plugging this into the unit speed relation 1 =u’(I/v2)+v’*(I/v2) then gives v= 
+v/1 — c*v?. Dividing v’ by u’ and integrating produces 


[us| 7S 


Le 
u-d=~ f sinwdw where v = — sinw 
c Cc 


II 
o| Ll is} 
— 
| 
y 
to 
vod 
is) 


e(u-dfv =1—cv 
I 
(u-dvtv=n. 
c 
This is the equation of a circle centered on the u-axis. Also, since the patch is v-Clairaut, vertical 
lines (i.e., v-parameter curves) are geodesics as well. This is the u’ = 0 case. So, the Poincaré 


plane P has as its geodesics arcs of circles centered on the u-axis and vertical lines (see Figure 5.7). 
These are the “straight lines” of P. 
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Figure 5.7. Geodesic on the Poincaré plane 


Exercise 5.4.14. Show that for the Poincaré plane, the parallel postulate does not hold. The 
parallel postulate says that, given a line and a point not on the line, there is precisely one line 
through the point which is parallel to the given line. The Poincaré plane is a model for the 
non-Euclidean geometry of Gauss, Lobachevsky and Bolyai. 


Exercise 5.4.15. Find the geodesics for the surface M of Exercise 5.4.4. 


Example 5.4.16 (Geodesics on the Hyperbolic Plane H). 

We compute geodesics as follows: (1) Since x is u-Clairaut, u-parameter curves are geodesics. 
Hence, radial lines through (0, 0) are geodesics. For non-u-parameter geodesics, apply the Clairaut 
integral formula (after simplification) to get 


a area. c(1 — u?/4) 


2 (& cia) 


This appears formidable, but it can be handled as follows. 
Exercise 5.4.17. Let w = c/uvl + c? (1 +-u*/4) and show that we have dv = Fdw/J/1 — w?. 


Then f dv = f dw/V1 — w? and v — vp = cos™! w. We then write cos(v — vp) = w or 


cos(v — v9) = (1 +u7/4) 


c 
uJ/1 +c? 


4uV/1 +c? 
Bist Misty creams Gee NET 


4uJ/1 +c? 


ue + 4 — —~——_ cos(v — v9) = 0. 
c 


To interpret this equation, replace polar coordinates (u, v) by rectangular coordinates (x, y), x = 
ucosv and y = wsinv, expand cos(v — vg) to cos vcos vp + sin v sin vg and complete squares. 
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We have, 
2 4 l+c 4/1+c? R . 
ué + 4 — ————_ c08 0p u cos v — ————— Sin ug usinv = 0 
c c 
2 2 4J/14+c? 4/14 c? . 
x+y +4 — ——— cos ) x — ———— sinuy y = 0 
c c 
2 2 
2714 ¢? 2V Pe? 4(1 +c?) 
2S Se a 00s 164 de Sg fe ee 
c € c 


2 2 

( wWI+e2 ( 2WJIte | 4 
xX — —— cosvo}] + | y — ———sinvof = eae 

c c c 

This equation represents a circle centered outside H (since 2/1 +c?/c > 2). Moreover, this 

circle intersects the boundary circle of H, x? + y? = 4, at right angles. Therefore, the geodesics 

of the hyperbolic plane are radial lines through the origin and arcs of circles centered outside 

radius 2 which meet H orthogonally. 


Exercise 5.4.18. Show that the circle above meets x* + y* = 4 orthogonally. Hint: 
2 2 
(QE5| -2+(2) . 
c c 


Note, again, that, given a geodesic and a point off it, there are an infinite number of parallel 
geodesics through the point. The hyperbolic plane is also a model for the Gauss-Lobachevsky- 
Bolyai non-Euclidean geometry. Indeed, the hyperbolic plane is isometric to the Poincaré plane 
That is, from the viewpoint of differential geometry they are indistinguishable. We shall conside! 
this more extensively shortly. 


Exercise 5.4.19. Find the geodesics on the stereographic sphere S;,. Hints: (1) the patch is 
v-Clairaut, (2) a convenient substitution may be cos v/(1 — sinv) = c sec@, (3) just as for the 
ordinary sphere, geodesics lie on certain kinds of planes — planes which always pass through a 
particular point of the sphere. 


Exercise 5.4.20. Let x(r, 9) = (r cos@, rsin@) be the polar coordinate patch for the xy-plane. 
Show that the inverse to stereographic projection is given by 
2rcos@ 2rsin@ r*—1 

ltr?’ [4r2? 14Pr2)° 


St '(r cos 9, rsin@) = ( 


Define a metric on the plane by taking w; o w2 = St, !(w;) - Sty !(w2) where - stands for the usua: 
metric on the sphere. This surface is called the Stereographic Plane. Show that 


= 4 F=0 G= 4r? 
~ +r?y? te ~ (1+r?) 


and K = +1. Find the geodesics of the stereographic plane by imitating the approach for the 
hyperbolic plane. In particular, the substitution w = c(1 — r?)/2r-/1 — c? may be useful. The 
unit circle and the lines through the origin are geodesics. Why? Finally, show that all geodesics 
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intersect the unit circle, the angle of intersection B obeys cos B = c, where c is the Clairaut 
constant and the formula for a geodesic becomes 

(x + tan B cos)? +(y + tanf sin@)* = sec* B 


(and therefore gives a circle). Now see Subsection 5.6.5. 


5.5 Isometries and Conformal Maps 


Suppose we have two surfaces M; x(u, uv) and N: y(r, s) given by the indicated patches, If there 
are smooth functions r and s from the domain of x to the domain of y, then we may define a map 
of surfaces ]: M — N by I(x(u. v)) = y(r(u, v), s(u, v)). We say that / is a (local) isometry if 


Ey = Xu % My = 1.(%u) Py I,(Xu)s x =X Oy Xy = 1(Xy) By 1,(Xy). 


Here, as usual, we assume that orthogonal patches are given, so we implicitly assume that 
0 = Fy = 1,(x,) %y /,(X,). By the chain rule, we know that /,(X.) = Yp fu + Yy Sy and 1,(%)) = 
¥-To + Ys Su, SO we get a very explicit check on whether or nat a mapping is an isometry. Namely, 
we must have 


2 2 2 2 
Ey = X, Oy X, = Eyr, + Gys;,, Gy = Xp Oy Xy = Eyr, + Gysy. 


Of course this is what we obtained in Exercise 3.2.4 for the special case of two parametrizations of 
the same surface (with / the identity mapping). Moreover, the very same calculations (restricted 
to our situation where F = 0) give ExGy = [ruSy — rvSu]° EyGy, so that plugging into the formula 
which defines K (and much tedious computation) produces the fundamental 


Theorem 5.5.1, /f7: M — N isan isometry, then the Gauss curvatures al corresponding points 
are equal, That ts, Ky(p) = Ky(1(p)) forall p = M. 


Exercise 5,5,2. In Exercise 3.4,2 it was shown that the surfaces (see Figure 5.8) 
x(u, Uv) = (ucosu, WSiN vv), y(r, 8) =(rcoss,rsins, Inr) 


have the same formula for Gauss curvature, K = —1/(1 + w?)? where w stands for uw and r 
respectively. Show that these surfaces cannot be isometric however. Hint: an isometry would 
require r(u. v) = u (up to sign), What goes wrong with E, = Eyr? + Gys2? 


Figure 5.8, Non-isometric surfaces with the same K 
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Remark 5.5.3. Ifthe parameter domains are the same and the functions r and s are the projections 
r(u,v) = u and s(u, v) = v, then an isometry simply means that the metric is preserved 


E,=Ey Kh=Fy. G,=Gy. (5.5.1) 


In fact, many authors say that the functions r(u,v) and s(u,v) give a new parametrization 
y(u, v) = y(r(u, v), s(u, v)) of the surface N, so that a mapping may be defined by /(x(u, v)) = 
y(u. v) and the question of whether J is an isometry is reduced to the simpler criterion above. 
Of course, the particular mapping must be checked to be a parametrization. In order to avoid 
this and to make the examples and exercises below more transparent, we have chosen to make 
the functions r and s explicit. But, in fact, the case above is the important one to know! As 
long as the mapping 1: M — N is locally one-to-one (i.e., small neighborhoods can be chosen 
about each point of M where J is one-to-one) and / satisfies (5.5.1), then J will provide new 
parametrizations for parts of N which are the image of a neighborhood on which J is one-to-one. 


Clearly, an isometry preserves the lengths of all tangent vectors (since each is a linear combina- 
tion of x, and x,.). Indeed, the word “isometry” comes from the Greek for “same measurement”. 
Furthermore, the formula (where - denotes any inner product) 


cos(@) = Emad 
Iv] iwi 
for the angle 6 between two vectors v and w makes it clear that isometries preserve the angles 
between vectors as well. 

The Inverse Function Theorem then says that each point of M has a small open neighborhood 
around it so that J has a smooth inverse function on that neighborhood. In the case where | 
is one-to-one, onto.and has a smooth global inverse /~!: N — M, we say that J is a global 
isometry. Although our definition of isometry is not the most general one which can be given, it 
will suffice for all the examples in this book. Let’s examine some of these now. 


Example 5.5.4 (Helicoid Isometry). 

Let two surfaces M and N be defined by patches x(u, v) = (ucos v, usin v, v) and y(r,s) = 
(sinhr coss, sinhr sins, s) respectively. The first patch we recognize as the standard helicoid. 
Define a map J: M > N by I(x(u, v)) = y(sinh”! u, v). Now, r = sinh”! u and s = v, so the 
relevant nonzero partials are r, = 1/coshr = 1//1+u2 ands, = 1. Then we calculate FE, = 
1 = (1/cosh*r) cosh? r = Eyr? and G, = | +. u? = cosh*r = Gys?. Hence, I is an isometry. 
Clearly, / is one-to-one and onto with smooth inverse /~!(y(r, s)) = x(sinhr, s), so J is in fact a 
global isometry. 


Exercise 5.5.5. Let z=a,/x*+y? be the cone with the parametrization x(u,v) = 
(ucosv,usinu,au). Let @ denote the vertex angle between the z-axis and the cone. Show 
that sing = 1/1 + a7. If we cut the cone along a ruling line, then we can unroll it onto the 
plane to get a pie wedge (see Figure 5.9). Let the vertex angle of the pie wedge be 6 and show 
that 6 = 27 sing. The unrolling process is given by y‘(u, v) = 


(uta cos( ). ul + ta? sin( ) anv) 


v v 
Vv14 ta? V1+4+ ta? 
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Figure 5.9. Unrolling a cone 


for 0 < t < 1. Check that this map gives the unrolling above. Show that the map defined by fixing 
any tf, /(ucosv,usinv,an) = y‘(u, v), is an isometry. Finally, show explicitly that / takes 
geodesics on the cone to geodesics (i.e., lines) in the plane. Hint: for the last part, a substitution 
w = AB/(A — B tan(v/V/1 + a2)) may be useful, where A = c/ sin D, B = ¢/ cos D and c and 
D come from the formula for geodesics on the cone. Also see Exercise 5.6.6. 


Exercise 5.5.6. Invert a cone with vertex angle @ so that it sits on the xy-plane. Throw a lasso 
around the cone (see Figure 5.10) with a loop of fixed length L and pull down on the free end of 
the lasso to tighten the loop around the cone. What happens? More specifically, does the lasso 
slip over the top or does it stay fixed? Does your answer depend on the vertex angle? Explain. 
Hints: (1) what kind of curve does the loop become? If the loop stabilizes, what are the only 
forces acting on it? Are there tangential forces? If not, then the acceleration of the curve is in 
what direction? (2) Now unroll the cone, splitting it along the line through the knot of the lasso. 
Where does the curve go? What if @ is too big? 


Exercise 5.5.7. For the cylinder x(u, v) = (u, cos v, | + sin) which sits on the x-axis, find an 
unrolling map (which cuts the cylinder along the top) and show that, for each fixed r, the resulting 
map is an isometry. Further, show that the geodesics on the cylinder go to straight lines in the 
plane under the final isometry /. Hint: consider right and left sides separately; roll the right half of 
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Figure 5.10. Lassoing a cone 


the cylinder to the right and when a point hits the x-axis, let it remain there; show that a formula 
for this procedure is given by the following. 


(u, t+sin(v—t), 1—cos(v—1)) fort <v 
Ttu,v,th= 

({u, v, 0) fort >v 
Now write a formula for the left side and then look at Subsection 5.6.3 for a Maple approach to 
this unrolling of the cylinder. 


The fact that the unrolling maps J for the cone and cylinder carry geodesics to geodesics 
is not a phenomenon special to these surfaces. In fact, by Theorem 5.1.5, we see that, since 
geodesic curvature depends only on the metric, isometries must therefore preserve it and hence, 
take geodesics to geodesics. Another proof of this is given in the following 


Theorem 5.5.8. Let a(t) be a geodesic ona surface M and let: M — N bean isometry. Then, 
the curve B(t) = I(a(t)) is a geodesic of N. 


Proof. In case the functions r and s above are the projections, the proof is trivial. On the other 
hand, if r = r(u, v) and s = s(u, v) are general smooth functions, then the calculations are quite 
tedious. We shall take a middle road and prove the theorem for the case r = r(u) and s = s(v). 
This will give the flavor of the calculations involved, but avoid unnecessary complications. With 
this in mind, take /(x(u, v)) = y(r(u), s(v)) and [(a(t)) = [(x(u(t), v(t))) = y(r(u(t)), s(v(2))). 
We wish to show that r(u(t)) and s(v(t)) satisfy the geodesic equations in y’s metric. Our 
assumption on r and s implies the following relationships between the metrics for x and y, 
BSE 


ue 


Gy=Gys,  Ey= Eu? Gy = Gyv? 


Also, the chain rule gives 
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Now we put r and s into the first geodesic expression and substitute the calculations above 
to get 


r’ Be Fy, 2 oe Bys aes = Gy, 2 a Ext +2Eyu,uy, r”? Ex,v,u? TAs = Gy, Ve, 12 
2Ey Ey 2Ey 2Eyu2 E,u2 2E,u2 
ry an upr? + Ap? 4 es usr's’ — Gay vis” 
aE uy Ey.” 2Ey u, 
" 12 
7 ue 72 " Xv no Gry v 
=rt+ Se ie te ree ae Ty = 
x r Ey 2E, u, 
l " Ey 72 E, he G, 12 
=— lu + rou +—uv - pv | =0 
ui, 2E EY OR 


since @ satisfies M’s geodesic equations. (Note that factoring out 1/u, gave r,u, = 1 in the 
numerators of the middle terms.) Therefore, 6 satisfies the geodesic equations for N. O 


Exercise 5.5.9. Show that r and s satisfy the second geodesic equation also. 


Exercise 5.5.10. Use the standard polar coordinate patch x(u, v) = (ucosv,usinv) for the 
hyperbolic plane H and define a map ]: H — P by 


4u sinv 4-1 


LOGE) aN eg. ay © aaa 
( (u v)) Gees. u2+4ucosv+4 


0) = y(r(u, v), s(u, v)) 

where P is the Poincaré plane parametrized by y(r, s) = (7, s). Show that J is an isometry. The 
map / is, in fact, a one-to-one onto map (with smooth inverse) to the upper half plane, so it is a 
global isometry. The formula above results from the Mobius (or linear fractional) transformation 
of complex variable theory, 


z—2 
T(z) = -i—., 
©) z+2 
where z = x + iy. As we saw in Chapter 4, complex analysis plays a large role in differential 
geometry. Here is a sample of the complex approach in terms of something we already know. 
Consider the hyperbolic plane H with standard polar coordinate u-Clairaut parametrization 
x(u, v) = (ucos v, u sin v). Since radial lines through the origin are u-parameter curves, they are 
geodesics. Now, given any point p = (uo, vo) in the disk u = 2, there exists a linear fractional 
transformation of the form 

_ azt+2e 


fk (a een ag 2_tcP=1 
(z) e+e ja\" — |e| 


which, in real plane coordinates and with the hyperbolic metric, is an isometry of H and which 
carries the origin to p. Geodesics through p are then images of radial lines through the ori- 
gin under 7. But it is known that linear fractional transformations take circles and lines to 
circles and lines. Also, since T is an isometry, and since radial lines through the origin meet 
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u = 2 perpendicularly, so must the geodesics through p. Hence, geodesics through p must 
be circles which meet the boundary circle u = 2 at right angles. This is what we calculated 
earlier. 


Exercise 5.5.11. Suppose we parametrize the helicoid by x(u, v) = (sinhu cos v, sinhu sin v, v). 
Smoothly bend the helicoid into the catenoid by defining (from t = > tot = 7) 


x'(u, v) = (sint sinhu cos v — cost coshu sin v, 
sint sinhu sinv + cost coshu cos v, 


ucost + vsint). 


Only at the last step is the bending not one-to-one. Show that, for every 5 <1 < 7, I(x(u, v)) = 
x'(u, v) is a global isometry and for t = 7, J is a local isometry. Also show that at each stage 
Gauss curvature is preserved (since it is determined by the metric) and each intermediate surface 
is minimal. 


Notice that stereographic projection from the ordinary (i.e., not the stereographic) sphere 
5?\{N} to R? is not an isometry. Recall that, on tangent vectors, we have 
8 


St,(x,) —sinucosv cosucosv 
x = ary] reine oe ng fo a 
ry 1 —sinv l—sinv’ 
cosu sinu 
St.(x,) = { ————_-, -_——_. 
Pye 1—sinv’ 1—sinv 


Hence, taking dot products in R?, we get St,(x,,) - St.(x,) = 0 as well as 


cos? v 1 


SESE) = Gc ama Game 


1 l 
St, vy)? St, v= = —= % v* Ap: 
(%») (%») (l—sinv)? (1 —sin vy 7m 


While the factor 1/(1 — sin v) indicates that stretching is taking place and, therefore, St canno! 
be an isometry, it also shows that the stretching is uniform and angles are preserved. Taking « 
stereographic projection as our basic example, let us weaken the isometry condition by requiring 
a mapping /: M — N to satisfy the following proportionality condition. For orthogonal patches 
x and y, say that /(x(u, v)) = y(r(u, v), s(u, v)) is a conformal map if 0 = 1,(X,) oy L(x) 
and 


Ey = Mu, v) [Eyre + Gysz], Gy = A(u, v)? [Eyr? + Gys?] 
for a function A(u, v) called the scaling factor. Again note that, when r(u, v) = wand s(u, v) = 2, 
T is conformal exactly when the metrics are proportional at each point p € M and its image 


(pen, 


Ey = A(u, vy Ey, Gy = A(u, vy Ey. 
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Exercise 5.5.12. Show that if J is conformal, then angles between tangent vectors are preserved. 
Hint: it is sufficient to show this for x, and x,. 


Note that stereographic projection is a conformal mapping from the sphere to the plane with 
the metrics induced by R?. This fact is essential for the complex analytic approach to minimal 
surfaces in Chapter 4. 


Exercise 5.5.13. Let x(u, v) = (Rcosucosv, R sinucos v, R sin v) denote the usual coordinate 
patch for the sphere of radius R. Define 


I(x(u, v)) = (u, Intan (5 Be 7) ,0) 


= y(r(u, v), s(U, v)) 


where y(r, 5) = (r, 5,9) is a patch for a strip in the coordinate plane. Show that J is conformal 
with scaling factor A(u, v) = Rcosv. 

Suppose a@ = x(u, v(u)) is a curve on the sphere which intersects every longitude (i.e., a v- 
parameter curve) at a constant angle. Show that v(u) = Cu/R? and show that such a curve has 
a straight line as its /-image. The map / is called the Mercator projection of a sphere onto a 
rectangle. In particular, the Mercator projection may be used to construct a map of the Earth. 
The curve property we have mentioned allowed sailors in earlier times to plot courses on a map 
as straight lines and then steer the plotted courses by keeping constant compass headings (i.e., 
making a constant angle with longitudes). Although these courses were not geodesics, the ease 
in navigation made up for the greater distance. For a nice discussion of the mathematics of map 
making see [RW84]. 


5.6 Geodesics and Maple 


5.6.1 Plotting Geodesics 


It has always been hard for geometers to visualize geodesics. This is due to the fact that solutions 
of the geodesic equations must generally be obtained numerically and then plugged into the 
surface parametrization. In years past, this process would require a great deal of complicated 
programming to mix the numerical analysis with 3D-graphics. Today however, a short and simple 
Maple procedure can accomplish the same task. Geodesics may be graphed on their surfaces and 
may give just the right kind of intuitive information to allow rigorous analysis. For example, the 
picture Figure 5.6 provides the intuitive insight necessary to solving Exercise 5.2.17. Of course, 
it is a simple matter for Maple to generate the geodesic equations for a given parametrization. 
The following procedures do exactly this. First, we need the metric (where, as usual, we take 
F = 0). 


> with(plots):with(LinearAlgebra) : 

> EFG := proc(X) 

local Xu,Xv,E,F,G; 

Xu := <diff(X[1],u) ,diff(X[2] ,u) ,diff(X[3],u)>; 
Xv := <diff(X(1],v) ,diff(X[2] ,v) ,diff(X[3],v)>; 
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E := DotProduct (Xu, Xu, conjugate=false) ; 
F := DotProduct (Xu, Xv,conjugate=false) ; 
G := DotProduct (Xv,Xv,conjugate=false) ; 


simplify([E,F,G]); 
end: 


The geodesic equations are given by 


> geoeq:=proc(X) 

local M,eqi,eq2; 

M:=EFG(X) ; 

eqi:=diff (u(t) ,t$2)+subs ({u=u(t) ,v=v(t)}, 

diff (M[1] ,u)/(2*M[1] )) *diff (u(t) ,t) “2+subs ({u=u(t) ,v=v(t)}, 
diff (M[1] ,v)/(M[1]))*diff (u(t) ,t)*diff (v(t) ,t)- subs({u=u(t), 
vev(t)}, diff (M[3] ,u) /(2*M[1] )) *diff (v(t) ,t)°2=0; 

eq2:=diff (v(t) ,t$2)-subs ({u=u(t) ,v=v(t) }, 

diff(M[1] ,v)/(2*M(3]))*diff (u(t) ,t)*2+subs ({u=u(t) ,v=v(t) }, 
diff (M[3] ,u)/(M[3])) *diff (u(t) ,t)*diff(v(t) ,t)+ subs({u=u(t), 
vev(t)}, diff (M[3] ,v) /(2*M[3] ))¥*diff (v(t) ,t)72=0; 

eqi,eq2; 

end: 


Exercise 5.6.1. Show that the following is a procedure for the geodesic equations when F + 0. 


> geoeq2:=proc(X) 

local M,G,E,F,D,Eu,Ev,Fu,Fv,Gu,Gv,eqi,eq2; 

M:=EFG(X); 

E:=M(1]; F:=M(2]; G:=M(3]; 

D:=E*G-F#F; 

Eu:=diff(M[1],u); Ev:=diff(M[1] ,v); 

Fu:=diff(M[2] ,u); Fv:=diff(M[2] ,v); 

Gu:=diff(M[3] ,u); Gv:=diff(M[3] ,v); 
eqi:=diff (u(t) ,t$2) + subs( {u=u(t),v=v(t)} , 
(G*Eu-F*(2*Fu-Ev)) /(2*D) )*diff (u(t) ,t)°2 

+ subs({u=u(t) ,v=v(t)}, (GeEv-F*Gu) / D)*diff(u(t),t)* 
diff(v(t),t) + subs( {u=u(t),v=v(t)} , 

(Gx (2*Fv-Gu) -F*Gv) / (2*D) ) *diff (v(t) ,t)*2=0; 
eq2:=diff(v(t) ,t$2) + subs( {u=u(t),v=v(t)}, 

(Ex (2*Fu-Ev)-F*Eu) /(2*D))*diff(u(t),t)°2 + subs({u=u(t), 
vev(t)}, (ExGu-F*Ev)/(D))*diff (u(t) ,t)*diff(v(t) ,t) 

+ subs ({u=u(t) ,v=v(t)}, (ExGv-F*(2*Fv-Gu) ) /(2*D) ) * 
diff (v(t) ,t)*2=0; 

eqi,eq2; 

end: 


Finally, we come to the centerpiece of this section, a Maple procedure which plots geodesics 
on surfaces. This procedure has a bewildering number of inputs, but each serves a function. 
The first input is, of course, the parametrization of the surface in question. The next four inputs 
give bounds for the parameters u and v of the surface. The inputs uO and v0 give an initial 
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point on the surface for the geodesic while Du0 and Dv0 give an initial direction. The input 
T gives a bound on the geodesic parameter t while N gives the number of points plotted on 
the geodesic. A larger N makes the geodesic appear smoother. The input “gr” is a vector which 
allows the grid size to be changed from its usual 25 by 25. Again, the larger the grid, the better 
the picture and the slower the rendering. Finally, the inputs “theta” and “phi” orient the output 


picture. 


> plotgeo:=proc(X,ustart,uend,vstart,vend, u0,v0,Du0,Dv0, 
T,N,gr,theta, phi) 

local sys,desys,ul,vi,listp,geo,plotx; 

sys :=geoeq(X) ; 
desys:=dsolve({sys,u(0)=u0,v(0)=v0,D(u) (0)=Du0,D(v) (0)=Dvo}, 
{u(t) ,v(t)},type=numeric, output=listprocedure) ; 
ul:=subs(desys,u(t)); vi:=subs(desys,v(t)); 

geo:=tubeplot (convert (subs (u=’ul? (t) ,v=’v1’ (t) ,X) , list), 
t=0. .T,radius=0.05,color=black ,numpoints=N): 
plotX:=plot3d(X,u=ustart..uend,v=vstart..vend,grid=[gr[1], 
gr[2]],shading=XY, lightmodel=light3): 

display ({geo,plotX},shading=XY, lightmodel=light2, 
scaling=constrained, orientation=[theta, phi] ); 

end: 


Here are two example surfaces. Plot some geodesics on them following the models for the 
torus below. 


> sphere:=<cos(u)*cos(v) |sin(u)*cos(v)|sin(v)>; 
sphere := [cos(u) cos(v), sin(u) cos(v), sin(v)] 
> torus:=<(4+cos(u))*cos(v) | (4+cos(u))*sin(v) |sin(u)>; 
torus := [(4 + cos(u)) cos(v), (4 + cos(u)) sin(v), sin(u)] 
> EFG(sphere) ; 
{cos(v)’, 0, 1] 


> geoeq(sphere) ; 


, 


G 0) ee 


dt? cos(v(t)) 


a? d : 
(F v(t)) + cox(v)} sin(v(t)) (= «) =0 


> EFG(torus); 
[1, 0, (4+ cos(u))’] 


Explain the following behavior (Figure 5.11) of a geodesic. Where and in what direction does 
the geodesic begin? Where does it always stay? Why? (Is the Clairaut relation involved?) 
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Figure 5.11. Geodesic bouncing between top 
and bottom parallels of torus of torus 


Figure 5.13. Geodesic on the inside parallel Figure 5.14 Geodesic on the inside parallel of 
of torus torus: an example of round-off error 


> plotgeo(torus,0,2*Pi,0,2*Pi,Pi/2,0,0,1,30, 150, [15,30], 
105,33); 

> plotgeo(torus,0,2*Pi,0,2*Pi,0,0,0,1,15, 75,[15,30], 
177,68); 

> plotgeo(torus,0,2*Pi,0,2#Pi,Pi,0,0,1,15, 165, [15,30], 
177,68); 


The following example, depicted in Figure 5.14, shows that round-off error plays a part in any 
computer calculation of geodesics. The initial data starts the geodesic along the inside parallel, 
but round-off error builds up as we go from tf = 0 to r = 35 (as opposed to t = 15 above), so the 
curve departs from its true trajectory (compare with Figure 5.13) 


> plotgeo(torus,0,2*Pi,0,2*Pi,Pi,0,0,1,35, 165, [15,30], 
177,68) ; 


Exercise 5.6.2. Consider a geodesic on the torus above (1.e., with R = 4 and r = |) that begins 
parallel to the top parallel circle at an angle p to x,,. The Clairaut relation gives R sin(do) = 
c, so we always have (R +r cos())sin(@) = R sin(@o). Show that we can choose the angle 
oo = arcsin((R — r)/R) so that the geodesic is asymptotic to the inner geodesic parallel circle at 
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u = 7. Now plot the geodesic with Maple (and be aware of round-off error). Hints: (1) for the 
first part, let « — a and @ — 7/2; (2) for the second part, choose Du0 and Dv0 appropriate 
for @y. Be careful! The angle @y is in the tangent plane, but this vector has been distorted by the 
parametrization from the vector determined by DuO and Dv0 in the wv-plane. Use the metric 
coefficients to take account of the distortion and find the correct direction in the wv-plane. 


Figure 5.15. Geodesic on the Witch 


Exercise 5.6.3. For the Whirling Witch of Agnesi of Exercise 5.2.15 and Figure 5.5, plot the 
geodesic referred to by the following. 


witch:=<2*tan(u) |2*cos(u)~2*cos(v) |2*cos(u) "2*sin(v)>; 
witch ;= [2 tan(u), 2cos(u)’ cos(v), 2cos(u)’ sin(v)] 
> plotgeo(witch,-1.1,1.1,0,2*Pi,Pi/3,0,0,1, 100,200, [15,30], 
92,-113); 


Can you see the effect of the Clairaut relation? 


Exercise 5.6.4. Plot geodesics for various initial directions and starting points on the sphere, 
cylinder, cone and paraboloid using the standard parametrizations for these surfaces. Relate your 
pictures to examples in the book. In particular, relate the cone and paraboloid to Exercise 5.2.14 
and Exercise 5,2.17 respectively. 


Exercise 5.6.5. Plot geodesics for various initial directions and starting points on the catenoid 
and on the hyperboloid of one sheet, Here are some further examples to help (see Figure 5.16, Fig- 
ure 5.17 and Figure 5.18). We consider the hyperboloid of one sheet with its usual parametrization 
and its ruled surface parametrization. 


hyperb:=<cosh(u) *cos(v) |cosh(u)+*sin(v) |sinh(u)>; 
hyperbruled:=<cos(u)-v+sin(u) |sin(u)+v*cos(u) |v>; 
hyperb := [cosh(u) cos(v), cosh(w) sin(v), sinh(w)] 
hyperbruled := [cos(u) — v sin(u), sin() + v cos(u), Vv] 
> plotgeo(hyperb,-2,2,0,2*Pi,0,0,0,1,10,25, [10,30] ,47,85); 
plotgeo(hyperbruled,0,2*Pi,-4,4,0,0,1,0,10,35, [30,10], 
47,85) ; 


246 


5. Geodesics, Metrics and Isometries 


Figure 5.18, Geodesic on a hyperboloid of I-sheet 
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> plotgeo(hyperb,-2,2,0,2*Pi,0,0,1,1,3.5,25, [10,30] ,14,85); 
plotgeo(hyperbruled,0,2*Pi,-4,4,0,0,0,1,4,25, [30,10] ,14, 85); 


> plotgeo(hyperb,-2,2,0,2*Pi,0.05,0,0,2,3,100, [10,30], 
45,45); 
plotgeo(hyperbruled,0,2*Pi,-4,4,0,0.5,1,0,5,25, [30,10], 
45,45); 


5.6.2 Geodesics on the Cone 


Let’s look at a specific example of a geodesic on a cone z = \/x? + y?. We use the formula for 
cone geodesics from Exercise 5.2.14: 


v 
u = C sec { ———— + D) 
(J 1+a2 
and consider a geodesic from (1, 0, 1) to (0, 1, 1) on the cone. This means that u = | at beginning 
and end with v = 0 at the start and v = 7/2 at the finish. Plugging into the general geodesic 
formula, we obtain 


1 
1=C sec(D), 1=C sec{ —— + D}. 
se ("5 ) 


These can be solved to give C = cos(D) and 
cos( 55 — 1) 
D = arctan eek es 
7 is 
sin( 55) 


> dd:=evalf(arctan((cos(Pi/(2*sqrt(2)))-1)/ 
sin(Pi/(2*sqrt(2))))); 

dd := —0.5553603670 
> cc:=cos(dd); 

cc := 0.8497104921 


So now we can plot the geodesic by replacing u in the parametrization of the cone by the 
formula above with the C and D just determined. See Figure 5.19. 


> geo:=tubeplot ([cc*sec (v/sqrt (2)+dd) *cos(v) , 
cc*sec(v/sqrt (2)+dd) *sin(v) ,cc*sec(v/sqrt(2)+dd)], 
v=0..Pi/2,radius=0.03,color=black): 

> cone:=plot3d([u*cos(v) ,u*sin(v) ,u] ,u=0..1.8,v=0..2*Pi, 
grid=[6,24] ,shading=xy, lightmodel=light2) : 

> circ:=tubeplot([cos(v) ,sin(v) ,1] ,v=0..Pi/2,radius=0.025, 
color=blue): 

> display({geo,cone,circ},scaling=constrained, orientation= 
[40 ,84]); 
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Figure 5,19. Geodesic and parallel circle joining (1, 0, 1) to (0, I, 1) 


In Figure 5.19 the geodesic and the parallel circle can be seen joining the given points. We 
can now compute the arclength of the geodesic from (1, 0, 1) to (0, 1, 1) and compare it with the 
arclength of the circle parametrized by (cos(v), sin(v), 1) joining the two points. Of course, the 
circle has radius |, so the arclength along it from (1, 0, 1) to(0, 1, 1)is simply 7/2 * 1.570796327 
since the angle formed by joining the center of the circle (0, 0. 1) to the points 1s 2/2. Here is the 
geodesic again. 


> alpha:=<cc/cos(v/sqrt(2)+dd)*cos(v) | cc/cos(v/sqrt (2) +dd) * 
sin(v) |cc/cos(v/sqrt (2)+dd)>; 


0.8497104921 cos(v) 0.8497104921 sin(v) 


a = 


2 )/2 
cos( *Y? 4, 0.5553603670 } cos(*? =: 0.5553603670 


0.849710492) 
5 
c0s( : ue m3 0.5553603670 


We take its velocity vector and then compute the arclength integral. (Note that the “map” 
command does “diff” to each coordinate of a.) 


> alphap:=map(diff,alpha,v): 


evalf (Int (sqrt ((alphap[1] ~2+alphap[2] ~2+alphap[3]~2)), 
v=0..Pi/2)); 


1.491286907 
So the geodesic has shorter length than the circle. 
Exercise 5.6.6. For Exercise 5.5.5. plot a geodesic with the same initial conditions on the 


unrolling cone for ¢ = 0, = 0.25, r = 0.5, t = 0.75 and 1 = 1. This illustrates that geodesics 
on the cone go to straight lines in the plane under the isometric unrolling as they must by 
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Theorem 5.5.8. The following commands give an animation showing a cone geodesic being 
taken to a straight line in the plane under the unrolling isometry. 


> Yt:=(u,v,t,a)-><u*sqrt (1+t*a*2) *cos(v/sqrt (itt*a*2)) | 
uxsqrt (1+t*a*2) *sin(v/sqrt (i+t*a*2) ) |a*tu*sqrt (1-t) >: 

> display(seq(plotgeo(Yt(u,v,i/20,1.5) ,0,3,0,2*Pi,3,0, -1, 
0.25,5,25, [5,20] ,163,67) ,i=0..20) ,scaling=constrained, 
insequence=true) ; 


5.6.3 Geodesics on the Cylinder 


In Exercise 5.5.7 an unrolling map for the cylinder was given. Here we would like to use it to 
illustrate Theorem 5.5.8. Specifically, we will unroll a cylinder with a geodesic helix on it and 
see that the helix goes to a line in the plane. 

The idea for unrolling a cylinder is to pretend that each side of the cylinder is a wheel rolling 
away from the middle. In this sense, the coordinates for the unrolling map are obtained by 
arguments similar to those used for the cycloid (see Exercise 1.1.13). The difference here is that 
when a point of the cylinder touches the ground, it stays there instead of rolling up the other side. 
This means we will need the ability to piece together the parts of the cylinder that are still moving 
and the parts that are stationary. Maple can do this using its “piecewise” command. We also must 
treat the left and right sides of the cylinder separately since they roll in different directions. Here 
is the Maple approach. We begin by only considering the parallel circles. Later we will bring 
the u-parameter back into the picture. According to the formula of Exercise 5.5.7, we have the 
following evolution of a parallel circle (to the right and left). Note that here we handle the second 
and third coordinates separately (by “fleft”, “gleft” and “fright”, “gright’). 


> fright:=(v,t)->piecewise(t<=v,t+sin(v-t) ,t>v,v); 
fright := (v, t) > piecewise(t < v, f+ sin(v — ft), v <?, v) 
> fleft:=(v,t)->piecewise(t<=v,-t-sin(v-t) ,t>v,-v); 
fleft := (v, t) > piecewise(t < v, —t — sin(v —t), v <t, —v) 
> gright:=(v,t)->piecewise(t<=v,1-cos(v-t) ,t>v,0); 
gright := (v, t) > piecewise(t < v, 1 —cos(v —t), v < t, 0) 
> gleft:=(v,t)->piecewise(t<=v,1-cos(v-t) ,t>v,0); 
gleft := (v, t) > piecewise(t < v, 1 — cos(v — 1), v < t, 0) 


We now put the left and right evolutions into separate mappings while bringing u (i.e., the 
distance along the cylinder) back into consideration. Note that Maple can define mappings by 
simply listing the inputs in parentheses and outputs as vectors (or lists). 


> Ytright:=(t,u,v)->[u,fright(v,t) ,gright(v,t)]; 
Ytright := (t, u, v) > [u, fright(v, ft), gright(v, t)] 
> Ytleft:=(t,u,v)->[u,fleft(v,t) ,gleft(v,t)]; 
Yileft := (t, u, v) > [u, fleft(v, t), gleft(v, 7] 
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Of course, we want to animate the unrolling, so we need fo create two animations — one left, one 
right — and then display them in sequence together. This means that we save the animations by 
naming them (remembering to use a colon rather than a semi-colon at the end in order to suppress 
plot structure information) and then use the command “display”, 


> right:=display(seq(plot3d(Ytright (i/20*Pi,u,v) ,u=-3..3, 
v=0. -Pi,grid=[8,20]) ,i=0..20) ,insequence=true) : 


> left:=display(seq(plot3d(Ytleft(i/20*Pi,u,v) ,u=-3..3, 
=0..Pi,grid=[8,20]) ,i=0..20) ,insequence=true) : 


> display({left,right},scaling=constrained, orientation 
=[18,76]); 


We know that a geodesic on the cylinder must be a ruling line, a parallel circle or a helix. The 
first two obviously unroll to straight lines. To see that helices also go to straight lines, we evolve 
a helix (e.g., (uw, sin(u), | — cos(«)) since the cylinder sits on the xy-plane) according to the 
unrolling mapping. See Figure 5.20. 
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Figure 5.20. Unrolling a geodesic on a cylinder 
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> helixevolveright:=(t,s)->[s,fright(s,t),gright(s,t)]: 


> helright:=display(seq(tubeplot (helixevolveright (i/20*Pi, 
s),s=0..Pi,radius=0.05,color=black) ,i=0..20) ,insequence=true) : 


> helixevolveleft:=(t,s)->[-s,fleft(s,t),gleft(s,t)]: 


> helleft:=display(seq(tubeplot (helixevolveleft (i/20*Pi,s), 
s=0..Pi,radius=0.05,color=black) ,i=0..20) , insequence=true) : 


> display({left,right ,helright ,helleft},scaling=constrained, 
orientation=[8,75]); 


We can also verify that the unrolling consists of isometries by calculation the metric at each stage. 


> cylR:=<ult+sin(v-t) |1-cos(v-t)>; 
cylR := [u, t+ sin(v —t), | — cos(v — 1)] 
> EFG(cylR); 
[1, 0, 1] 


Exercise 5.6.7. For Exercise 5.5.11, as the helicoid is isometrically bent into the catenoid, plot 
a geodesic with the same initial conditions fort = 2/2,t = 27/3,t = 5x/6,andt=7. 


5.6.4 Geodesics on the Unduloid 


We obtained a parametrization for unduloids in Theorem 3.7.7, so we can plot geodesics on 
unduloids as well (following [MO03b]). Recall that the metric coefficients for the parametrization 
of the unduloid are 


E=4a'? dr'(u), F=0, G=a?(1+e) dn*(u). 


These only depend on u (as they must since the unduloid is a surface of revolution), so the 
parametrization is u-Clairaut. Therefore, a Clairaut relation holds: 


VG sin(6) = a(1 + €) dn(w) sin(6) = c. 


Geodesics on the unduloid obey the Clairaut relation. Thus, their behavior can be predicted. 
Moreover, because unduloids are u-Clairaut, geodesics are characterized by the integral 


_ 4 2dn(uo) a du 
C+) Jus /dn2(u) — dn2(uo) 
dn 


(uo) / : du 
=+ ———— 
Ve Jug /sn2(uo) — sn2(u) 
Here, uo is the “initial” point of the geodesic and we have chosen @ = 2/2 in the Clairaut relation 
to get c = a(1 + €)dn(uo) which has been substituted in the general integral formula. Note that v 
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is continuous and monotonically increasing in u. This explains why our geodesics always proceed 
in a “forward” direction. 
The following steps determine closed geodesics on unduloids. 


¢ Write a Maple procedure with inputs € (to specify an unduloid) and up (to specify a starting 
point for a geodesic which starts parallel to a parallel circle of the unduloid) and output the 
u-value corresponding to when the geodesic achieves revolution variable v equal to z. 


© We look for those outputs u which are zero. Because we use numerical solutions of the 
geodesic equations to compute the endpoints of geodesics, we cannot hope to obtain u = 0. 
Rather, because v is continuous and monotonically increasing, we look for positive and 
negative outputs in order to guarantee a starting up with final value vu = 0 atv =z. 


¢ The process above determines a closed geodesic on the unduloid for the following reasons. 
First, in the parametrization of the unduloid, u = 0 corresponds to the “equatorial” geodesic 
which is the parallel circle at the top of the unduloid’s hump. The unduloid is symmetric with 
respect to this equator, so if we know that the ug geodesic travels 7 around the unduloid 
at u = 0, then symmetry says that it will travel another z until it reaches the point on 
the unduloid corresponding to —ug. Then it will bounce off the parallel circle at —ug and 
travel around by a total of 27 back to the original parallel circle. Again, symmetry says 
that the geodesic must close up. Note that the geodesic obeys the Clairaut relation and, by 
[dC76, section 4-7, p. 302] (see Exercise 5.2.18) a geodesic can never be asymptotic to a 
non-geodesic parallel circle. Thus, the geodesic must bounce back. 


Remark 5.6.8. We note here that it is not true in general that the endpoints of geodesics 
depend continuously on initial conditions. The fact that it is true here is due to the fact that 
the parametrization is u-Clairaut. 


Geodesics on the unduloid may be found by solving the geodesic equations numerically using 
“dsolve” and then putting the numerical solution values into the parametrization for the unduloid. 
This is the content of the following procedure (which simply specializes “plotgeo” to the case 
of the unduloid). The inputs include the ¢ of the parametrization, initial points and derivatives, a 
running time “T”, the number of points used to plot each geodesic “N”, a grid of the form [a, 5], 
the orientation of the plot given by “theta” and “phi” and the color of the geodesic. 


> undugeo:=proc(eps,u0,v0,Du0,Dv0,T,N, gr,theta,phi,col) 
local kk2,del,del0,ulim,desys,ul,vi,geo,plotX, equator; 
kk2:=evalf (2*sqrt (eps) /(1teps)); 
ulim:=fsolve(JacobiSN(u,kk2)=1,u) ; 

del:=subs (a=1, [-a*(1-eps) *(Elliptick(kk2)+ 

EllipticF (JacobiSN(u,kk2) ,kk2))- a*(iteps)*(EllipticE(kk2) 
+ EllipticE(JacobiSN(u,kk2) ,kk2)), 

a*(iteps) *JacobiDN(u,kk2)*cos(v) ,a*(1+eps) * 
JacobiDN(u,kk2)*sin(v)]); 

del0:=subs ({a=1,u=0}, [-ax(1-eps) *(EllipticK(kk2) + 
EllipticF(JacobiSN(u,kk2) ,kk2)) - a*(i+teps)*(EllipticE(kk2)+ 
EllipticE(JacobiSN (u,kk2) ,kk2)) ,a*(1+eps) *JacobiDN(u, kk2)* 
cos(v) ,a*(1+eps) *JacobiDN(u,kk2)*sin(v)]); 
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desys:=dsolve({geoeq(del) ,u(0)=u0, v(0)=v0,D(u) (0)=Du0, 
D(v) (0)=Dv0}, {u(t) ,v(t)},type=numeric, output= 
listprocedure) ; 

ul:=subs(desys,u(t)); vi:=subs(desys,v(t)); 
geo:=tubeplot (subs (u=’u1’ (t) ,v=’v1? (t) ,del) ,t=0..T, 
radius=0.02,color=col,numpoints=N): 

equator :=tubeplot (del0,v=0. .2*Pi,radius=0.015, color=black) ; 
plotX:=plot3d(subs({u=u(t) ,v=v(t)},del) ,u=-ulim..ulim, 
v=0..2*Pi,grid=[gr [1] ,gr{2]] ,shading=xyY) : 

display ({geo,plotX,equator},style=wireframe,scaling= 
constrained, orientation=[theta,phi]); 

end: 


Of course, we shall mostly be concerned with ¢ # 0. In this case, a geodesic (which we hope 
will be closed) must bounce back and forth between parallel circles. The following procedure 
determines the u-value when the geodesic has gone around the unduloid by z. If u = 0, then 
symmetry considerations provide a closed geodesic. Because a numerical solution cannot be 
trusted to be 100% accurate, we need to use something like monotonicity and continuity to 
guarantee a closed geodesic. Notice several things about the procedure “halfbouncepoint” below. 
First, it is a procedure which encourages experimentation; that is, searches for closed geodesics 
are carried out by trial and error. Secondly, note that we dictate that the geodesic starts parallel 
to a parallel circle (at uo) by taking “D(v)(0)=1” and “D(u)(0)=0”. Third, we use the “fsolve” 
command to solve for the time “ttt” when the geodesic goes 7 around the unduloid. We then 
plug this “ttt” into “uuu” to find the corresponding value of u — “uuu0” — and either output this 
value or an error message when “fsolve” cannot find a numerical solution within a specified time. 


> halfbouncepoint:=proc(u0,eps) 

local kk2,del,geosystem,uuu,vvv,ttt,uuu0; 
kk2:=2*sqrt (eps) /(1+eps) ; 

del :=subs (a=1, [-a*(1-eps) *(EllipticKk(kk2)+ 

EllipticF (JacobiSN (u,kk2) ,kk2)) 

-a*(1teps) *(EllipticE(kk2)+ EllipticE(JacobiSN(u,kk2) ,kk2)), 
a* (1+eps) *JacobiDN (u, kk2) *cos(v) ,a*(1+eps) * 
JacobiDN(u,kk2) *sin(v)]); 

geosystem: =dsolve({geoeq(del) ,u(0)=u0,v(0)=0,D(u) (0)=0, 
D(v) (0)=1}, {u(t) ,v(t)},type=numeric, output= 
listprocedure, range=0. .25) ; 

uuu:=subs(geosystem,u(t)); vvv:=subs(geosystem,v(t)); 
ttt:=timelimit (30,fsolve(vvv(t)=evalf (Pi) ,t)); 

if type(ttt,float) then 

uuu0:=eval (uuu (t) ,t=ttt) ; 

uuu0; 

else print(‘time error‘); 

end if; 

end: 


The following example with ¢ = 0.3 shows our method. The first application of “halfboun- 
cepoint” shows the existence of a closed geodesic. Namely, a switch from negative to positive 
and the Intermediate Value theorem guarantee a uo with output equal to zero — that is, a closed 
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geodesic.Further applications zoom in on the correct value for u0 and the closed geodesic is then 
plotted (see Figure 5.21). 


> for jj from -2 to 2 do 
print (0.99+j j*0.005,halfbouncepoint (0.99+j}j*0.005,0.3)); od; 


0.980, —0.00354601376090929326 
0.985, —0.00155537162876178532 
0.99, 0.000478 13034775 1062978 
0.995, 0.00255504657297548180 
1.000, 0.00467593536017002748 


> for jj from -2 to 2 do 

print (0.9888+j j*0.0001,halfbouncepoint (0.9888+ jj*0.0001, 
0.3)); 

od; 


0.9886, —0.0000956024228562558400 
0.9887, —0.0000547339092084 732186 
0.9888, —0.000013848294 1866547530 
0.9889, 0.0000270544615188724781 
0.9890, 0.0000679746262203287898 


> for jj from -2 to 2 do 
print (0. 9888338558+j j*0.0000000001 ,halfbouncepoint ( 
0. 9888338558+ j j*0.0000000001,0.3)); od; 


0.9888338556, —0.205198910164031134 10~° 
0.9888338557, —0.121012293501782109 107° 
0.9888338558, —0.368252030431837074 107 '° 
0.9888338559, 0.473615710452207628 107 !° 
0.9888338560, 0.131548331038996991 10° 


Now we can plot the approximation to the closed geodesic given by starting at ug = 
0.9888338558. The result is seen in Figure 5.21. 


> undugeo(0.3,0.9888338558,0,0,1,100,200, [20,20] ,-90,68, 
blue); 


Exercise 5.6.9. The method just used for the unduloid can be used for any u-Clairaut parame- 
trization displaying symmetry about a “hump”. In particular, a surface of revolution such as 
the Whirling Witch of Agnesi (see Exercise 5.2.15) is suitable for this approach. Find closed 
geodesics on the Whirling Witch by creating a “halfbouncepoint” procedure. 
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Figure 5.21. Two views of a closed unduloid geodesic for ¢ = 0.3 


5.6.5  Geodesics on Surfaces not in R? 


Finally, for surfaces not in R?, but which are plane regions with conformal metric, we can still 
write Maple procedures to calculate the geodesic equations. We again need the metric first. The 
inputs now are a parametrization X and a scaling function g. 


> EFGconf := proc(X,g) 

local Xu,Xv,E,F,G; 

Xu := <diff(X([1] ,u) ,diff(X[2],u) ,diff(X[3],u)>; 
Xv := <diff(X[1],v),diff(X[2],v),diff(X[3],v)>; 


E := DotProduct (Xu, Xu, conjugate=false)/g*2; 
F := DotProduct (Xu, Xv,conjugate=false)/g*2; 
G := DotProduct (Xv, Xv,conjugate=false)/g~2; 


simplify([E,F,G]); 
end: 


Now the geodesic equations for the conformal metric are given by the following procedure. 


> geoeqconf :=proc(X,g) 

local M,eqi,eq2; 

M:=EFGconf (X,g); 

eqi:=diff (u(t) ,t$2)+subs ({u=u(t) ,vev(t)},diff(M[1] ,u)/ 
(2*M[1]))*diff (u(t) ,t)“2+subs ({u=u(t) ,v=v(t) }, diff (M[1] ,v)/ 
(M[1])) *diff (u(t) ,t) *diff (v(t) ,t)-subs({u=u(t) ,v=v(t)}, 
diff (M([3] ,u)/(2*M [1] ))*diff (v(t) ,t)°2=0; 

eq2:=diff (v(t) ,t$2)-subs({u=u(t) ,v=v(t)}, diff (M[1] ,v)/ 
(24%M[3])) *diff (u(t) ,t)°2+subs ({u=u(t) ,v=v(t)}, diff (M[3] ,u)/ 
(M[3])) *diff (u(t) ,t) «diff (v(t) ,t)+subs({u=u(t) ,v=av(t)}, 
diff (M[3] ,v)/(2*M[3] )) *diff (v(t) ,t)°2=0; 

eqi,eq2; 

end: 
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Finally, we can write a procedure which plots geodesics on surfaces defined by a conformal 
metric. 


> plotgeoconf:=proc(X,g,ustart ,uend,vstart,vend,u0,v0,Du0, 
Dv0O,T,N,gr,theta, phi) 

local sys,desys,ul,vl,listp,geo,plotX; 

sys :=geoeqconf (X,g); 

desys :=dsolve({sys,u(0)=u0,v(0)=v0,D(u) (0)=Du0 ,D(v) (0)=Dv0}," 
{u(t),v(t)},type=numeric, output=listprocedure) ; 
ui:=subs(desys,u(t)); vi:=subs(desys,v(t)); 

geo;=tubeplot (convert (subs (u=’ul? (t) ,v=’v1’? (t),X), list), 
t=0..T,radius=0.025,color=black, thickness=2,numpoints=N) ; 
plotX:=plot3d(X,u=ustart. .uend,v=vstart..vend,grid=[gr[1], 
gr[2]],shading=XY, lightmodel=light3) : 

display ({geo,plotX},shading=XY, lightmodel=light2,scaling= 
constrained, orientation=[theta, phi] ); 

end: 


Let’s look first at the stereographic plane defined in Exercise 5.4.20. Recall that geodesics 
may be radial lines or circles (where the unit circle is a special case). See Figures 5.22 and 
5.23. 


> polarplane:=<u*cos(v) |u*sin(v) |0>; 

polarplane ;= [u cos(v), usin(v), 0] 
= plotgeoconf(polarplane, (1+u~2)/2,0,4,0,2*Pi,1,0,0,1,8,30, 
[10,30] ,0,0) ; 
> plotgeoconf(polarplane, (1+u72)/2,0,4,0,2*Pi,0.5,0,1,0, 
1.08,30, [10,30] ,0,0); 
> plotgeoconf (polarplane, (1+u~2)/2,0,4,0,2*Pi,2,0,1,1,8,50, 
[10,30] ,0,0) ; 
> plotgeoconf(polarplane, (1+u~2)/2,0,13,0,2*Pi,8,0,4,1,35, 
200, [10,30] ,0,0); 


Figure 5.22. Unit circle and radial line geodesics on stereographic plane 
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Zegna 


Son 


Figure 5.24. Geodesic on the stereographic sphere 


Exercise 5.6.10. Try to create geodesics on the stereographic sphere of Exercise 5.4.7 as follows 
(see Figure 5,24). 


> sphere:=<cos(u)*cos(v)|sin(u)*cos(v) |sin(v)>; 
sphere := [cos(u)cos(v), sin()cos(v), sin(v)] 


> plotgeoconf (sphere, 1-sin(v) ,0,2*Pi,-Pi/2,Pi/2,0,1,1,-2, 
15,650, [25,25] ,60,66); 


Exercise 5.6.11. Plot the geodesic on the Poincaré plane given in Figure 5.7. In other words, 
modify the plotgeo procedure to handle this situation. Hint: try 


> wuphalf:=<ul|v|0>; 
uphalf :=[u, v, O} 


> plotgeoconf(uphalf,v,-5,5,0.1,7,-4,0.2,1,20,2.5,800, 
[11,20] ,-90,-16); 
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Figure 5.25. Stereographically projected spherical curves 


5.6.6 Stereographic and Mercator Projections 


Maple can be used to “visualize” stereographic projection and the Mercator projection. The 
following procedure uses the formula for stereographic projection in Cartesian coordinates: 
St, y, 2) = (¢/(1 — 2), y/( — 2), 0). 


> Stereo:=proc(alpha,a,b,viewx1,viewx2,viewyl,viewy2) 

local sqleng,curv,sph,projcurv; 
sqleng:=VectorNorm (alpha, Euclidean, conjugate=false) “2; 
curv:=<alpha[1]/(1-alpha[3]) |alpha[2]/(1-alpha[3])>; 
projcurv:=plot([curv[1] ,curv[2] ,t=a..b] ,color=black, 
thickness=2 , numpoints=200) ; 

display ({projcurv},scaling=constrained,view=[viewx1..viewx2, 
viewy1..viewy2]); 

end: 


In Figure 5.25 we see some projected curves. The second is the projection of a latitude circle. 


> Stereo(<cos(t)*2,sin(t)*cos(t),sin(t)>,-Pi,Pi,-0.5,2, 
-1.6,1.6); 

> Stereo(<cos(t)*cos(Pi/4) ,sin(t)*cos(Pi/4) ,sin(Pi/4)>,0, 
2*Pi ,-3,3,-3,3); 


We can also define a procedure to do “inverse stereographic projection”. That is, we start with 
curves in the plane and pull them back to curves on the sphere (which map to the original curves 
under stereographic projection). See Figure 5.26. 


> InvStereo:=proc(alpha,a,b,theta, phi) 

local sqleng,curv,sph,plcurv,sphcurv; 
sqleng:=VectorNorm(alpha,Euclidean, conjugate=false) “2; 
curv: =<2*alpha[1]/(1+sqleng) ,2*alpha[2]/(1+sqleng) , 
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Figure 5.26. Inverse stereographically projected plane curves 


(sqleng-1)/(1+sqleng)>; 
sphcurv:=tubeplot (convert (curv, list) ,t=a..b,radius=0.015, 
color=black ,numpoints=200) ; 
sph:=plot3d([cos(u)*cos(v) ,sin(u) *cos(v) ,sin(v)] ,u=0.. 
2*Pi,v=-Pi/2..Pi/2): 

display ({sph,sphcurv},scaling=constrained,shading=zhue, 
orientation=[theta, phi] ,style=wireframe) ; 

end: 


> InvStereo(<t+1,2*t+3,0>,-8,8,60,52); 
> InvStereo(<cos(t)73,sin(t)*3,0>,-8,8,15,117); 


In Exercise 5.5.13 the Mercator projection was developed and it was stated that the projection 
is a conformal map. In particular, the Mercator projection preserves angles, a crucial feature for 
sailors in years past with regard to navigation. The most important use of this conformality was 
to plot a course by a straight line on a flat map making a fixed angle with vertical lines on the 
map and then navigate by sailing so that the course made the same angle with longitudes (i.e., 
North) on the Earth. We shall see such a loxodromic course below in Figure 5.28. 


> Merc:=proc(alpha,a,b,viewxl,viewx2,viewy1,viewy2) 

local curv,projcurv; 

curv: =<arctan(alpha[2] /alpha[1]) ,1/2*1n((1+alpha[3])/ 
(1-alpha[3]))>; 
projcurv:=plot( [curv [1] ,curv[2] ,t=a..b],color=black, 
thickness=2) ; 

display({projcurv},scaling=constrained, view=[viewx1..viewx2, 
viewyl..viewy2]); 

end: 


The following commands project latitudes and longitudes. See Figure 5.27. 


> Merc(<cos(t)*cos(Pi/4) ,sin(t)*cos(Pi/4) ,sin(Pi/4)>,0,2*Pi, 
-2,2,-2,2); 

> Merc(<cos(t)*cos(Pi/4) ,cos(t)*sin(Pi/4) ,sin(t)>,-Pi/2, 
Pi/2,-2,2,-2,2) ; 
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Figure 5.27. Mercator projection takes latitudes to horizontal lines and longitudes to vertical lines 


The next procedure takes plane curves and pulls them back to the sphere by the inverse of the 
Mercator projection. See Figures 5.28 and 5.29. 


> InvMerc:=proc(alpha,a,b) 

local sqleng,curv,sph,plcurv,sphcurv, invlogalpha; 
invlogalpha:=<exp(alpha[2])*cos(alpha[1]) ,exp(alpha[2] )* 
sin(alpha[1]) ,0>; 
sqleng:=VectorNorm(invlogalpha, Euclidean, conjugate=false) ~2; 
curv: =<2*invlogalpha[1]/(1+sqleng) ,2*invlogalpha[2] / 
(1+sqleng) , (sqleng-1)/(1+sqleng)>; 
sphcurv:=tubeplot (convert (curv, list) ,t=a..b,radius=0.015, 
color=black ,numpoints=200) ; 


Figure 5.28. Inverse Mercator projection Figure 5.29 Inverse Mercator projection takes a 
takes lines to loxodromes parabola to an interesting curve 
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sph: =plot3d([cos(u) *«cos(v) ,sin(u)*cos(v),sin(v)] ,u=0..2*Pi, 
v=-Pi/2..Pi/2): 

display ({sph,sphcurv},scaling=constrained, shading=zhue, 
style=wireframe) 

end: 


We can now display a loxodrome (Figure 5.28) by pulling back a plane line under inverse 
Mercator projection. This is the path a ship would follow to keep a constant angle with “North”. 


> InvMerc(<5*t+3,t+1,0>,-5,5); 
Here is another interesting curve on the sphere obtained by pulling back a parabola. 
> InvMerc(<t*2,t,0>,-5,5); 


Exercise 5.6.12. Carry out the following Maple commands and make sense of the resulting 
pictures. 


> InvMerc(<3,t,0>,-5,5); 

> InvMerc(<t,0.5,0>,-5,5); 

> InvMerc(<t,t72,0>,-5,5); 

> InvMerc(<t+1i,sin(t) ,0>,-5,5); 

> InvMerc(<sin(t),cos(t),0>,-5,5); 

> InvMerc(<cos(t)73,sin(t)~3,0>,-5,5); 


The bane of map makers has always been the fact that there is no way to both preserve 
angles and areas at the same time. This is essentially a consequence of Gauss’s Theorem 
Egregium (Theorem 3.4.1). Below, we calculate the area element for the Mercator parametri- 
. Zation. 


> mercproj:=<ulln(tan(v/2+Pi/4)) |0>; 
mercproj := eB In (tan (5 + *)) ; 0| 


The area element for a parametrized surface is given by EG — F?. Let’s compare the area 
elements for the Mercator parametrization of the plane (i.e., a map) and for the sphere. 


> mercmet:=EFG(mercproj); 


mercmet := | 1, 0, 
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The area element is then 


> simplify (sqrt (mercmet [1] *mercmet [3] -mercmet [2]*2), 
symbolic); 


vo W\2 


2 v ) 
t aieae cea 
an (5 + 4 
> simplify(1/(2*expand(sin(1/2*v+1/4*Pi) *cos(1/2*v+1/4*Pi)), 


symbolic)); 
1 


v\2 
2 cos (5) -1 
This actually simplifies to 1/ cos(v). The sphere’s area element is found as follows. 


> sphmet:=EFG(sphere) ; 
sphmet := [cos(v)”, 0, 1] 


This clearly gives an area element cos(v) for the sphere. If we define the distortion of the map by 


Area element of Mercator i” 1/ cos(v) a l 


Areaelement of sphere —  cos(v) —_—cos*(v)’ 


then we see that areas are distorted under the Mercator projection. Note that the amount of 
distortion depends on the latitude v. Namely, the distortion increases as v approaches +7 /2. This 
means that regions at high latitudes appear larger than they really are on a Mercator map. Some 
people claim that this was one reason why Europeans preferred the Mercator projection as they 
began to explore a very large world. For more on the geometry of maps, see [Fee02]. For a Maple 
approach to mapping the world, see [TBO01]. 


5.7 An Industrial Application 


In this section, we present an application of differential geometry to an industrial problem. This 
application may be found in [BM95] and the reader will find some details there that we omit 
here. Because we cannot possibly include all details of calculations, this section 1s somewhat of 
an extended exercise. 

A common problem in industry is that of creating a smoothly run procedure for packaging 
production items. Typically, packaging material (e.g., plastic wrap) is unwound from a horizontal 
roll and fed onto a curved surface (called a “shoulder”) which serves to guide the material into 
a vertical cylinder where it molds itself against the cylindrical wall and is sealed along the front 
edge and bottom to form a “bag” of some type. Production items are then dropped into the bag, 
the top is sealed and the bag is dropped from the cylinder. The problem with such a process is 
that, unless the shoulder has the proper shape, crinkling of the packaging material may occur 
and this may lead to material jams stopping the process. So the geometric problem that arises 
is to determine a shoulder shape which makes the process ‘“‘smooth”. Because the shoulder is a 
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transition surface from a plane (i.e., the unrolled plastic) to a cylinder obtained without stretching 
or tearing, it is intuitively clear that it should be isometric to the plane (and to the cylinder of 
course). This means that the shoulder should be a developable surface. In order to make sense 
of some of the discussion that follows, the reader may want to jump ahead to Figure 5.30 for a 
picture of a possible shoulder. 

Practically speaking, developability is achieved as follows. We take a curve @(v) in the plane 
which is concave and has its single maximum on a vertical axis about which a@(v) is left-right 
symmetric. We want @ to have width 27 R at some specified height above the horizontal axis. 
Now roll the plane vertically around a cylinder of radius R. The points of @ at width 27 R come 
together to give a closed curve a on the vertical cylinder. The part of the plane below height h 
becomes the cylinder while the part of the plane above @ is folded back to become the shoulder. 
The curve @ is called the bending curve. Finally, to make the transition from the plane to the 
cylinder as smooth as possible, we may require a part of the shoulder to be planar as well. The 
questions are: what should a be and how should the folding be done? This is where differential 
geometry comes in. 

Let’s write a(v) = (v, z(v)), -—t@R <u <aR, for some function z(v) of the sort men- 
tioned above (e.g., an even function, for instance). We may then parametrize a by a(v) = 
(R cos(v/R), R sin(v/R), z(v)) using the standard parametrization for a vertical cylinder of ra- 
dius R. We can relate & and a as follows. First, if@ is parametrized by arclength s, then we have 
s = f |@(v)dv with 


ds dv l 

—=VJl+z*? or —=——. 

ds /1 + Zee 

The curvature formula from Theorem 1.4.5 then gives formulas for the curvatures of @ and a 
respectively. 


za Luv A ae (Reza? + ag + 1)! 
(1 + zy?)3/? R(1 4+ z,2)3/7 
Note that |@’ x @”’|? = —z,, since we assume @v) is concave; that is, z,, < 0. Comparing 
formulas, we see that 
] 
ear + 


Rl +22) 
Therefore, € < «. Similarly, the torsion t of a can be computed to be 
Ss: ears s 
R(R22yy? + Zp? + 1) 
With this preliminary information in hand, we can start to look at the developable surfaces 
associated to @ and a@. These are ruled surfaces of the form 


X(s,u) = a(s) + ud(s), x(s, u) = a(s) + ud(s), 


where s is chosen to be the arclength of @ (and so also the arclength of a since we require an 
isometry). Let’s focus on the vector-valued functions d(s) and d(s) which give the directions 
for the ruling lines of the surfaces x and x. We will describe these vectors in terms of the basis 
{T(s), N(s)} in the plane and {T(s), N(s), B(s)} in 3-space. Furthermore, the bending isometry 
of the plane onto the shoulder should carry T(s) to T(s) for all s and must preserve all angles. 
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Therefore, N(s) must be taken to a unit vector that is perpendicular to T(s): hence, this vector 
may be written as cos(¢(s))N(s) + sin(¢(s))B(s). Therefore, we can write 


d(s) = cos(8(s))T(s) — sin(6(s))N(s) 
d(s) = cos(5(s))T(s) — sin(d(s))(cos(¢(s))N(s) + sin(P(s)))B(s) 


where 5(s) does not change because isometries preserve angles. (The minus sign is chosen in d(s) 
because d lies outside of the quadrant defined by T and N.) Note that 5(s) is simply the angle 
that a ruling line makes with the curve a. Now, using the standard trick that d- d = 1 implies 
d-d=d- d, = 0, we easily compute the metric coefficients (with E = x, - x, for instance) 


E=(f+ud,)-(T +ud,), F=T-d=cos(6), G=d-d=1, 
E=(T+ud,)-(T +ud,), F=T-d=cos(6), G=d-d=1. 


Note that F = F and G = G, so obtaining an isometry from the plane to the shoulder simply 
requires E = E to hold. Expanding the dot products and comparing coefficients of powers of u 
yields the conditions 


T-d,=T-d, and d,-d,=d,-d,. 
We can now use the Frenet formulas to express d, and d,: 


d, = (—sin(8)5, + sin(5)K)T + (cos()K — cos(6)5,)N 


d, = (— sin(6)é, + sin(5) cos(@)« )T 
+ (cos(6)x — cos(5)6, cos(@) + sin(d) sin(d)d, + sin(6) sin(@)t)N 
+ (—cos(6)6, sin(@) — sin(d) cos(@)t — sin(6) cos(¢)¢,;)B. 


Here we write ¢, = d¢/ds and 6, = d5/ds. Note that torsion t arises in the second equation 
because d lives in 3-space. 


Exercise 5.7.1. Verify the expressions for d and d given above by using the Frenet formulas. 


We can now begin to consider the conditions for isometry. We compute: 
T .d, = —sin(6)5, + sin(S)K 
d,-d, = 5.’ +k — 25K 
T -d, = —sin(5)6, + sin(d) cos(¢)x. 
The quantity d, -d, has a rather messy form, so we will wait to give it below when we discuss 


the condition d, -d, = d, - d,. The condition T -d, = T - d, gives 


K —RZyy 
5) = = (RP te DIE 
This determines ¢ up to two possibilities since, for 0 < @ < 27 say, there are two angles, 
g; and @2 with cos(¢;) = K/x, sin(d,) > 0 and cos(¢2) = K/k, sin(@2) < 0. In other words, 
0< 1 < m/2 and = —¢. 
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After a great deal of algebraic manipulation, the condition d, - d, = d, - d, reduces to 


K* — 28, =(sin?(5) cos*(p) + cos?(6))k? — 2 cos(¢)« 6, + (2 cos(S) sin(S) sin(¢))« ds 
+ (2 cos(8) sin(S) sin(®))«t + sin?(6)(¢, + T)*. 


Exercise 5.7.2. Verify this calculation. 


Now, we know K = x cos(#), so we have K* — 25,K = x? cos?(#) — 25, cos(#). Hence, we 
can replace the left side of the equation above and simplify to get 
0 = (sin?(5) cos*() — cos*(#) + cos?(5))k? + (2 cos(5) sin(S) sin())«(@, + T) 
+ sin?(5)(@5 + 1)? 
= (cos*(8) sin’*(¢))k? + (2. cos(5) sin(8) sin(d))K(@, + T) + sin?(S)\(b, +t). 


This is a quadratic equation in ¢, + tT, so we may use the quadratic formula to obtain 


_ cos(6) 
sin(d) 


bds+tT= sin(?)x. 


This may also be written as 


tan(5) = —x sin(p)/(¢, + T) 
or 
gos = —t —x sin(¢) cos(d)/ sin(6). 


Now let’s compute @, from cos(¢) = —RZvy/(R?Zyy2 + v2 + 1)!/?. From sin?(#) + cos?(¢) = 
1, we first get 


el + z,7)!/2 
Gute: (eae 
sing) = 2y1/2 
14+ zy 
sae een for = ¢2 


(R2Zpy? + 2p? + 1)? 


Now differentiate cos(¢) with respect to s (using the chain rule and recalling du/ds = 
(1 + z,2)7!/*) and solve for ¢,. 


. dcos(¢) 
— sin(g)py = “SS 
= =RZyy(R?Zy7 + Zur +1)+ RZ pCR 2uytave + ZyZuv) 
a + Zy2)!/2(R2Zyy2 + Zy2 + 1)3/2 
R21 a5 Zi°) = Racor 
od; = 


( =e SCR Zaee + Zit + 1) 


266 5. Geodesics, Metrics and lsometries 


where + refers to @; and 2 respectively. If we substitute the expression for @, into that for tan(64), 
we obtain 

Ry? + Zy? +1 
(2R2Zyuy + rane! + Zp") = R222 yy2 


tan(d,) = 


1 
tan(d2) = —. 
z 


v 


Here, as usual, 5; corresponds to angle ¢; and 42 corresponds to ¢2. Furthermore, putting 
os = —t — xk sin(@) cos(d)/ sin(é) into the expression for d, gives a much nicer form: 


d, = (kK — 6;)(sin(d)T + cos(d) cos(¢@)N + cos(d) sin(¢)B). 
Exercise 5.7.3. Verify the calculations above. 


Now, for the surface x(u,s), we can calculate the unit normal U using x, =d and x, = 
T +ud,: 


X, X Xs; = —sin(@)(sin(d) + u(K — 6,))N + cos(#)(sin(5) + u(k — 6;))B 
|x, x x,| = sin(d) + u(k — 6,;) 
U = —sin(¢)N + cos(¢)B. 


Since eventually we want to put everything into coordinates for explicit calculation, we 
need coordinate representations for T, N and B. We use the explicit coordinate form a = 
(R cos(v/R), Rsin(v/R), z(v)). 


_ da 
~ dv 


= eee (- sin(=) ; cos( =) ‘ Zu) 


5 ae (- sin) cos(—) zu) 
(1+ 2y2)/2(R2zyy2 + 2,2 + 1)!/2 R)? Rye 
CeeP oe (-om( 3). -80(3) 
abr ara He (og) sing) Rew) 

Ren? Fae ced sin( 3 z 
B=TXxN 


~ (R22 y,2 — +2 (Ren cos(=) +2y sin(=) cee cos( =) ae oe sin(2) ') 


Using these expressions as well as the ones for cos(@) and sin(#), we can find a coordinate 
representation for U. Of course, there are two cases: @ = ¢; and ¢ = @>. Let ¢ = g. Then 
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sin(p) = —(1 + 2,7)'/?/(R? zy? + Zy? + 1)!” and it is easy to compute 


ts (-en(Z).-sin(2).0) 


This is the inward pointing normal of the cylinder! When ¢@ = ¢;, then sin(@) = (1 + 
Zy°)!/2/(R2Zyy? + Zy2 + 1)!/? and the third coordinate of U; is —Rzvy/(R?Zw? + 2? + 1) > 0. 
This is the unit normal of the shoulder. 
The angle @ between the outward normals U, and —U, is found by taking the dot product. 
cos(#) = U; -(—-U2) 

= (—sin(¢1)N + cos(¢1)B) - (sin(¢2)N — cos(¢2)B) 

= — sin(¢1) sin(¢2) — cos(g;) cos(¢2) 

= — cos(¢; — ¢2) 

= — cos(2¢)) since $2 = —¢, 

= —cos($1) + sin*(¢;) 

= | — 2cos*(¢1) 


z\2 
Jey (=) | 
K 
In coordinates, we have 
—R22,,2 + 25° + l 
R2zy,7 +22 +1 © 
We can also write the direction vector d in terms of z using the z-expressions for cos(@), sin(@), 
T, N and B. Of course, any vector in the d-direction will serve equally well, so we can multiply d 
by anything we want. Let d = (X(1 + z,7)!/*/ sin(5)) d, where we write X = R*z,,2 + 2,7 + 1. 
Then we have 


d= es ( - sin(=) ; cos(=) ‘ i) + [(Ren?2 -d+ z")zu)sin(=) 


Vv ) 
—2R2yy(1 + z")eos(=) »—(R22yy22y — (1 + 2,7) zy) cos( =) 


cos(@) = 


SOR le z")sin(=) R22 — (+ 2}. 


Now we can substitute 
cos(5) ] OR 23:8 + zy) + Zi) = R?ZZv7 


sin(S)  tan(6) xX 


and simplify to get 
eet 2 2) 2 (U\ | 2 bs 
d =((2R22wy + 21 + 22) —( + zy )z») sin( = ) 2Rew(1 + 20°) cos(=), 
_ 2 2y! 2 ce 2» nf YU 
((2R2Zyyy + Zo) + 22) — (1 t 2p Jz») eos( =) 2Rew(1 + z0")sin(=), 


= (2R? ZyZv0v + zy) + zie) + erg + a) -U+ gery): 
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If we now divide by (1 + zy”), we obtain a simple direction vector. (In fact, in the Maple procedure 
below, we use the computing power of Maple to get a unit vector in this direction in order to 
control the length of ruling lines better.) 


a fv v v _ fv 
d= (2R%20 sin(=) — 2RZyy cos(=) IR ia cos( =) —2RZyy sin(=) P 
i QR zizew = ae + R? Zp" = 1). (5.7.1) 


We now have many of the components we will need to create a suitable shoulder. The last 
piece of the puzzle will fall into piace when we consider how to include a planar triangle into 
the shoulder to facilitate a smooth transition from an unrolled planar sheet of packaging material 
to a “cylindrical” bag. We situate the planar triangle so that a vertex is at v = 0 with z(v) an 
even function (i.e., z(—v) = z(v)) to ensure symmetry about the vertical axis. We also assume 
the maximum of z is at v = 0 (Z.e., z,(0) = 0). 

Recall that 5(s) is the angle between the ruling lines of the shoulder and the (tangent of the) 
curve a. Because the triangle fits into the top of the shoulder, the angle 5(s) is discontinuous 
at s = 0 (i.e., v = 0). This is the case because the ruling lines move along the curve a and 
at s = 0 there are two ruling line choices: the left and right edges of the triangle emanating 
from the vertex at v = 0. We denote the left limit angle by 5~ and the right by 5*. By sym- 
metry, 5” = 7 — 5*. Of course, the angle of the triangle between these ruling lines is given 
by 


B=S —&t =n -28t. 
To find £, take 


d -d' =d~-d* 
= cos(S~) cos(5*) + sin(S~) sin(*) 
= cos(é~ — 6+) 


= cos(x — 25*). 


Of course, it is tan(é,(s)) which determines 5)(s) and tan(5;(s)) depends on Zyy,. Therefore, 
to make 5,(s) discontinuous at s = 0, we take z,,, discontinuous at v = 0. Let zy, )(07) = 
—Zyy(07) < 0. 

Using z,(0) = 0 and our formula for tan(5), we obtain 


BV or 2g 
tan( £) = tan(> —6 ) 
_ 1 
~ tan(5*) 
_ 2Rzyv(0*) 
~ R2z,,2(0) + 1 
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where 2,y,(0+) < 0. Thus B is determined. The angle 6) between the planar triangle and the 
tangent plane to the cylinder at (R, 0, z(0)) is determined by 


—R?z,,7(0) + 1 


cos(6o) = “Ra 0) + 1 


since z,(0) = 0. 

All of this information can now be assembled into an algorithm for finding an appropriate 
bending curve a meeting certain geometric conditions. For example, we can specify the height 
h of the shoulder, the radius R of the cylinder, the angle 4) between the shoulder and cylinder at 
the highest point of a, the angle 6, between the shoulder and cylinder at the lowest point of a, 
and the opening angle § of the planar triangle with vertex at the highest point of a. 

First, to non-dimensionalize the problem, take z = Rf(€),& =v/R, a <& < x, with 
3 


sing) —§ + =|, 


6 


2 
f(E) = co + en? +0318)? +-ca( coste) - 1+ =) cs 


f"(&) < 0 (for concavity) and f(z) = 0. The form of f is suggested by an analysis of conical 
shoulders which may be found in [BM95], but which we omit. (We here use the notation of 
[BM95] to aid the reader who is interested in examining the complete paper.) The fact is that the 
exact definition of such an f is really a matter of trial and error — the first method of science. In 
order to determine an appropriate shoulder, we must find coefficients cg, c2, c3, C4, Cs Satisfying 
the geometric requirements noted above. 

Now, assigning shoulder height z(0) = h gives f(0) = 4/R, so we have cp = h/R. We also 
have 


—R?z,,7(0) + 1 
R?z,,2(0) + 1 
_ -’" OY +1 
(FOR +1 
43 +1 

4c3 +1 


cos(4%) = 


and we can solve for c2 (taking the negative sign to have f’(0) = 2c2 < 0) to get 


1 (1 —cos()\'7 1 tan( 
Cc. = —= | —————— =—-> —]}. 
> "2 \1 + cos) 2 1 
Now, f(€) has a discontinuous third derivative at & = 0: f’”(0*) = +6c3. Therefore, we get 
a planar triangle of opening angle 6 with 


2RZyy(0T) aS 2f"(0") 12c3 


B\ — — 2Rezwo(0*) FOF) | 
in(5) = R220) +1 (f"(O)P +1 48 41 


Thus, prescribing 6 determines c3. The requirement f(7) = 0 gives 


2 3 
co ten? + e307? +¢4 (-2 + =) + ¢5 (-x + =) = 0. (5.7.2) 
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We want 6 = 6; at the lowest point (i.e., & = 2) of the bending curve, so we have 


—f"@)y +(F'@yY +1 
(FDP + (FP +1” 


cos(9;) = 
This is equivalent to 


6 
(f'n) — (Ff oy + Ita? (2) = 0, 


and, using the formula for f(é), we calculate this to be the equation 


(2co + 6c37 + 2c4 + 205)" 


2 2 
@ 
= ((201 + 3307 + cam +5 (-2 + =)) + ) tan?) =0. (5.7.3) 


The equations (5.7.2) and (5.7.3), together with the previous determination of cg, c2, c3, determine 
c4 and cs. Therefore, the geometric requirements lead to a specific f(€) and, consequently, a 
specific bending curve. Of course, we wish to parametrize the shoulder, so we need a direction 
vector as well. For this, we simply substitute the following into (5.7.1): 


gE = > z(v) = Rf (&), Zp = ratae Zw = =f"). Fike os f"). 


We obtain a direction vector 


d= (s ™() sin(€) — f" cos(&), — f'"(E) cos(&) — f'"(&) sin(é), 


~ f@F"e)- — + 


Ey 47 e) 1 

5 5 ;). (5.7.4) 
Example 5.7.4 (A Shoulder Curve). 
Let h/R = 6 = co, % = 90°, 6; = 10° and B = 90°. We get c, = —} tan(6)/2) = —} tan(45°) 
= -}. Similarly, tan(45°) = (—12c3)/2, so c3 = —1/6. We solve the equations (5.7.2) and 
(5.7.3) with 6; = 10° using Maple’s “fsolve” command (see the Maple procedure below). We 
obtain cq + .986 and cs ~ .596, so that we have 
#3 


2 
5 sing) —§ + — 


f(—&)=6- 3 - ier + .986 (coste) Sie =) + .596 


We can write a Maple procedure with inputs R, h, 6), 0;, 8B, as well as a grid size and an 
orientation, to plot the resulting shoulder. Note that we use Maple’s “unapply” command to 
create a function simply to demonstrate an alternative to the arrow construction. We also use the 
“assign” command to assign the values of cq and cs determined by “fsolve” so that they may be 
used in later parts of the procedure. Finally, the direction vector “direction” is made into a unit 
vector by multiplying by the inverse of its length. 


> with(plots) :with(LinearAlgebra) : 

> shoulder:=proc(R,h,highangle, lowangle,gapangle,gr,ori) 
local c_0,c_2,c_3,c_4,c_5,f,fp,fpp,fppp,direction,sh, 
bendcurveplot,should,cyl,theta_0,theta_i,beta; 
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theta_0:=Pi/180*highangle; 

theta_1:=Pi/180*lowangle; 

beta:=Pi/180*gapangle; 

c_0:=h/R; 

c_2:=-0.5*evalf(tan(theta_0/2)); 
c_3:=-(4*c_2°2+1) *evalf (tan (beta/2)/12); 
f:=unapply(c_0+c_2*xi"2+c_3*abs (xi) “3+c_4*(cos(xi)- 
1+xi72/2)+c_5*abs(sin(xi)-xi+xi73/6) ,xi); 
fp:=unapply (diff (f(xi) ,xi),xi); 
fpp:=unapply (diff (f (xi) ,xi$2) ,xi); 
fppp:=unapply (diff (f£(xi) ,xi$3) ,xi); 

assign (fsolve({f(Pi)=0,fpp(Pi) “2-(fp(Pi) ~2+1)* 
tan(theta_1/2)72=0},{c_4,c_5})); 

print (‘coeffs are‘ ,c_0,c_2,c_3,c_4,c_5); 
direction:=ScalarMultiply (<-fpp (xi) *cos(xi)+fppp (xi) * 
sin(xi) |-fpp(xi)*sin(xi)-fppp (xi) *cos (xi) |-fp(xi)*fppp(xi)+ 
(fpp (xi) *2-fp(xi)*2-1)/2>, 1/(sqrt((-fpp (xi) *cos(xi)+ 
fppp(xi)*sin(xi))°2 +(-fpp(xi)*sin(xi)-fppp(xi)*cos(xi))72+ 
(-fp (xi) *fppp (xi) +(fpp (xi) 72-fp(xi)~2-1)/2)72))); 
sh:=<R*cos(xi) |R*sin(xi) |R*f(xi)>+ 

ScalarMultiply (direction,u); 
bendcurveplot:=tubeplot ([R*cos(v) ,R*sin(v) ,R*f(v)], 
v=-Pi..Pi,radius=0.03,color=black): 
should:=plot3d(sh,xi=-Pi..Pi,u=0..3*R,grid=gr): 
cyl:=plot3d( [R*cos(v) ,R*sin(v) ,u] ,v=-Pi..Pi,u=-h/2. .R*f(v), 
grid=[30,10]): 

display ({bendcurveplot,cyl,should},style=patch, orientation 
=ori,scaling=constrained) ; 

end: 


We can now perform the procedure, starting with the values used in Example 5.7.4. Note that 
we output the coefficients in the order co, c2, C3, Ca, Cs. We show two views in Figure 5.30. 
The first view clearly shows the planar triangle while the second view shows the angles 4 
and 0; : 


> shoulder(1,6,90,10,90, [30,10], [161,63]); 
coeffs are, 6, —0.5, —0.1666666667, 0.9861424355, 0.5964019490 


> shoulder(1,6,90,10,90, [30,10] , [88,87]); 
coeffs are, 6, —0.5, —0.1666666667, 0.9861424355, 0.5964019490 


Let’s take another case with R = 2, 0) = 110° and 6; = 0°. The result is shown in Figure 5.31. 


> shoulder(2,13,110,0,60, [30,10], [161,63]); 
13 
coeffs are, 7 —0.7140740040, —0.1462431472, 1.448802757, 0.4097160318 


> shoulder(2,13,110,0,60, [30,10] , [86,88]); 
13 
coeffs are, sy oo OT IAO A000, 0.146243 1472, 1.448802757, 0.4097160318 
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Figure 5.31. A shoulder with planar triangle and 4) = 110° 
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This section has shown us several things. We have seen that differential geometry is not just 
the mathematics of abstract physics, but the mathematics of everyday industry as well. But this 
fact also leads us in the direction of practicality and we have seen just how complicated everyday 
industry can be. Finally, all of the assumptions, calculations and approximations above would be 
meaningless without the ability to see the end result — and here is where computers and software 
such as Maple have changed the way we do mathematics. 


Holonomy and the Gauss-Bonnet 
Theorem 


6.1 Introduction 


Euclid’s parallel postulate states that through any point off a given line there is a unique parallel 
line. The following exercise reminds us of how such a postulate may be used to derive essential 
geometric properties. 


Exercise 6.1.1. Use the parallel postulate to prove that the sum of the angles of a triangle is zr. 
Hint: draw a line through a vertex of the triangle parallel to the opposite side. 


At one time it was thought that the parallel postulate could be derived from other natural 
axioms of Euclidean geometry. It was only through the work of Lobachevsky, Gauss and Bolyai 
in the 1800s that mathematicians realized just how misguided this attempt had been. Perhaps it 
shouldn’t be such a surprise that Gauss would see the necessity of actually postulating the parallel 
postulate. His investigations into the geometry of surfaces led not only to his Theorem Egregium 
concerning Gauss curvature (which, in fact, had previously been defined by O. Rodrigues), but 
to an understanding of various fundamental geometric quantities on surfaces. In particular, the 
“lines” of a surface may be considered to be its geodesics and Gauss could see that geodesics did 
not always follow the dictates of the parallel postulate. 


Exercise 6.1.2. Discuss the parallel postulate’s validity for the geometries of the sphere and the 
hyperbolic plane where lines are geodesics. 


As the previous exercise demonstrates, the parallel postulate may fail in two ways. Because 
we have seen above that the parallel postulate may be used to calculate the sum of angles in 
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x 


u 


Figure 6.1. An infinitesimal piece of area 


a triangle, we might expect that its failure would lead to variability among such calculations. 
Indeed, we find that there are three possibilities for the sum of the angles of a triangle. 


(1) Euclidean. In the Euclidean plane , the angles of a triangle obey the equality ey o; =T1. 


(2) Hyperbolic (Gauss—Lobachevsky-Bolyai). In the hyperbolic plane, the angles of a trian- 
gle obey the equality Bae o; < 7. 


(3) Elliptic (Riemann). In the sphere, the angles of a triangle obey the equality Yo oj > 1. 


In this chapter, we will see how Gauss curvature affects even the most basic quantities of geometry 
such as angle sums of triangles. Further, we shall see that Gauss curvature makes its presence 
felt even in the most basic of geometric notions, the notion of when two vectors are parallel. 

Before we begin, we must recall how to integrate on a surface. Readers of Chapter 4 may 
skip the following redundant discussion. First, consider how we compute the surface area of 
a surface. For example, the surface area of an R-sphere in 47 R? and the intuitive reasoning 
behind this is as follows. Let M be a surface and consider a patch x. To approximate the area 
of the patch we might find the area of each parallelogram as shown in Figure 6.1 and then sum 
them up. 

Of course, as usual in mathematics, we would then take a limit to make our approximations 
approach the true area. The limit then gives a continuous sum, the integral. It is important to 
note that we are defining the integral here. For sufficiently complicated “surfaces” outside our 
restricted smooth definitions, this notion of integration may fail. The area of such a parallelogram 
may be found by Exercise 1.3.8. Namely, Area = |x, x x,|. Hence, we have 


Vv) uy 
Area of patch x = / | |X, X X,|dudv. 
vo ug 


Example 6.1.3 (Surface Area of the R-Sphere). 


Recall that |x, x x,| = R2cosvand0 <u < 27, — 5 <u < 5. The area of the R-sphere is then 
given by 
dn pS 2n 15 
Area = [ / R* cosudv du -|/ R? sinvdu 
09 ¥-5 Q 1-5 
2x 2n 
= / 2R? du=| 2R*u = 4nR?. 
0 0 
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Note that, by Lagrange’s Identity (Exercise 1.3.5), |x, x x,.| = VEG — F?. 
Exercises 6.1.4. Show that the surface area of the torus is 42?r R. 


Exercises 6.1.5. In calculus, two of the usual integrals used to calculate surface area are: 


b 
(1) Surfaces of revolution: SA = / Qn f(x) V1 + fx) dx, 


a 


b pd 
(2) Graphs of functions z = f(x, y): SA= i] i] ghee, to pardy: 
a Cc 


Show that these formulas may be derived from the definition of surface area given above. 


Now, how do we integrate a function over a surface? A function simply gives different weights 
to the points in a region and then “adds up” the weighted area. Hence, we simply multiply the 
function by |x, x x,| and integrate. 


Definition 6.1.6. The integral of f over M is defined by 


fort ff pi xxiaudy 
M x 


= [ [ VEG= Pau dv. 


We shall be interested in the integral of a specific function — the Gaussian curvature K. The 
integral [ u XK is called the total Gauss curvature of M (or of a patch x). 


Example 6.1.7. (Total Gauss Curvature) 
We have the following examples of the total Gauss curvature. 


(1) The R-sphere. The Gauss curvature is the constant K = 1/R?, so 


1 | 
[k= ga] [im xian = 35 Area 


1 2 
R2 
(2) The Pseudosphere: The Gauss curvature is constant here as well, K = —1/c?. Hence, 
! ] 
/ K =-—.- Area = —— -2nc* = —2z. 
M c ct 


Exercise* 6.1.8. Find the surface area of the pseudosphere (or bugle surface). 
Exercises 6.1.9. Find the total Gaussian curvature of the torus and of the catenoid. 


6.2 The Covariant Derivative Revisited 


In our initial approach to the geometry of surfaces in R* we used the covariant derivative to 
define the shape operator, and the various basic curvature invariants followed from this. Later, 
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we defined Gauss curvature for surfaces not in R? by taking a particular formula for the curvature 
and using it as a definition. In this way we circumvented defining a covariant derivative for such 
abstract surfaces. In this section, we will see that defining and understanding a notion of covariant 
derivative in a more abstract context is worthwhile for reasons beyond the definition of curvature 
alone. 

To begin, let’s take x to be a patch with (as usual) F = 0 and suppose there is a closed curve 
(assumed to be unit speed or, at least, constant speed) a(/) = x(a(t), b(1)). Let EF; = x, / VE and 
£5 =x, / JSG and note that €; - €; = | and €; - €) = | (with €; - E; = 0). Thus, &; and €), which 
we say are fields of vectors along the curve, form an orthonormal basis for the tangent space at 
any point x(u, v). For this reason the basis of vector fields (€&;, &2) is called a moving frame 
on the patch. (More generally, such a basis may be found even in the case of a non-orthogonal 
patch.) Further, €;, & and U form an orthonormal basis for R?. Now, just from this information 
and what we know about covariant derivatives, we will analyze how the vector fields &; and &> 
change along a. 

First, we will consider the most concrete case of a surface in 3-space. This will allow us to see 
how the covariant derivatives of 3-space and the surface itself relate to each other. Indeed, the 
latter is defined in terms of the former. Then, once we understand what the covariant derivative 
tells us in this situation, we can eliminate 3-space from the discussion and abstract the qualities 
of the covariant derivative to other surfaces. 

We will use the notation V"" for the covariant derivative (or Jacobian matrix) in R! and V 
for either the projection of V®' to the tangent plane of the surface or simply for the covariant 
derivative of M considered without regard to 3-space. Let Z = (z', z?, 2°) be a vector field in 
> defined by functions z': R? — R. Recall that the IR?-covariant derivative vez is simply 
a coordinate-wise version of the directional derivative (see Section 2.2). Then, for M C R?, 
the projection of ve’ Z onto the tangent plane 7;,(M) is obtained by subtracting off its normal 
component, 


WZ = VE Z-(VEZ-U)U. 


This is then the definition of the covariant derivative intrinsic to the surface itself (see Figure 6,2). 
Again, as in Section 2.2, we can take covariant derivatives of vector fields defined on M (or 


Figure 6.2. Definition of V)Z 
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portions of it) by using extensions of the vector field to open sets containing M (or the portion 
thereof). From now on, we will simply assume this technical point and the fact that we can take 
covariant derivatives of all the vector fields that arise. 


Exercise 6.2.1. Show that VR a! =a”. Hence, a is a geodesic if Vya’ = 0. Hint: Consider 
da’ /dt as a function of the three variables a', a? and a?. This allows the formula for the chain 
tule to apply to a’[f] = d(f oa)/dt, where f = da’ /dt. Then see Chapter 2. 


Now we write VEE, = wE) + @21€2 + 5,U and VR E> = W12E, + wW272E2 + 52.U, where the 
w’s are functions of the curve w’s parameter t. Using the product rule coordinatewise and the 
definition of the R? covariant derivative, we compute 


0= a’[E, : &] = 2vF'E, : E; = @jI- 


In general, the formula a’[V-W]= VE'V -W+V.- VE Ww holds for vector fields V = 
(v', v?, v’) and W = (w!', w’, w*). This may be seen by the following computation (which 
is reminiscent of the proof of Lemma 2.4.1). 


a'[V-W]=a’ [ye vw] 
a So a'[v']w' +va'[w'] 
= (VE V) wi +v'(VE wy 
=VPv.w+v-VRWwW 
Similarly, since 0 = a’[€; -&] = VRE, -Q+€&- VP’ E>, we have @2; = —@}2. 


Exercise 6.2.2. Show s; = S(a’)-&; and s) = S(a’)- €& where S(q’) is the shape operator ap- 
plied to a’ and sy is the U-coefficient of VE’). 
Finally, we obtain 

VEE, = wn€, + (S(a’)- EU 

Vee = —W2E€) + (S(a’) -E2)U. 


Therefore, remembering that the unadorned notation V refers to the projection of V®’ onto the 
surface’s tangent planes, we have 


t VaiE, = 1&2, VarE2 = —w1€1. 
This description of the covariant derivative V in terms of the coordinate-wise directional deriva- 
tives in R? (i.e., VR’) implies properties such as the following. 
Exercise 6.2.3. Show that the following properties hold for the covariant derivative. 
(1) Sum Rule. Vya+eeZ = fVaZ + gVpZ. 
(2) Leibniz Rule. Va f Z = “2°Z + fVaZ. 
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(3) Commutation Rule. If x(u, v) is an orthogonal patch, then “mixed second partials are 
equal” 


Vx,X0 = Vs, Xu - 
(4) Compatibility Rule. For a tangent vector w, E = x, -x,, F = 0 (say) and G = x, -X,, 
w[E] = 2 VwXu Xu, w[G] = 2 VwXp © Xp. 


These are the types of properties which we can abstract from the situation of a surface M in R?. 
In particular, the last two rules generalize to give a useful covariant derivative associated to the 
so-called Riemannian connection. We shall need these later when we discuss the influence of 
Gaussian curvature on surfaces like the hyperbolic plane. 


6.3 Parallel Vector Fields and Holonomy 


Suppose now that Vp is a unit tangent vector to M at a(t). One of the first questions of classical 
differential geometry is the following. Along the curve a, what tangent vectors (to M) should 
be considered parallel to Vo? After all, the notion of vectors or lines being parallel is basic to 
geometry, so it should not be so surprising that the question of parallelism arises in differential 
geometry as well. Because the covariant derivative V,’V tells us how the (tangent) vector field 
V changes along a, the condition VV = 0 precisely expresses the desired parallelism of all the 
vectors comprising V. In other words, a vanishing covariant derivative implies (as it should) that 
V changes only in the normal direction to the surface and so is unchanging from the viewpoint of 
residents of the surface. Therefore we say that V is a parallel vector field along a if Va'V = 0. 


Exercises 6.3.1. Show that a parallel vector field has constant length. Hint: use the Leibniz rule 
onV-YV. 


Of course we need to know that parallel vector fields actually exist. In fact, we can uniquely 
determine a parallel vector field along a curve from initial data alone. 


Theorem 6.3.2. Let M be a surface with covariant derivative V and suppose a: I > M isa 
curve on M. For any tangent vector to M, Vo at a(0), there exists a parallel vector field V along 
a with Varo) = V. 


Proof. Without loss of generality take Vp to have unit length. Because a parallel vector field must 
have constant length, in fact every vector in the field will have length ] as well. Then we may 
write V = cos6 €; + sin@ € where @ is the angle from V to €;. The desired condition Vx. V = 0 
then becomes (by product and chain rules) 


. dé dé ; 
0 = —sin@ oe E; + cos6 Vy E; + cosé a €2 + sin 8 Vy E>. 


Using our previous calculations of the covariant derivatives of €; and € along a (formulas f), 
we obtain 


do dé 
0 =-—sin@ [on + nar + cos 6 [on + pals 
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do 


Since sin @ and cos @ cannot be zero simultaneously, we must have a; = —@21 OF 


A(t) = 6(0) — : wy) dt. 


This formula then defines @ and, hence, the parallel vector field V. O 


The construction of a parallel vector field V along a from an initial Vp allows us to speak of 
the parallel transport of Vo along a. Namely, the parallel transport of Vo along a to a point a(t) 
is defined (and uniquely determined) to be Vai,). The angle of rotation (which only depends on a 
and @(0)), — f @»1 dt, is called the holonomy along a. 


Exercises 6.3.3. Show that a parallel vector field V along a geodesic a makes a constant angle 
with a’. More generally, suppose V is a parallel vector field along a and W is a vector field 
along a of constant length. Show that W is parallel if and only if the angle between V and W 
is constant. Use this to show that the holonomy around a curve does not depend on the parallel 
vector field along the curve. 


Exercise 6.3.4. Prove the following formula for geodesic curvature (see Theorem 5.4.12). 


dé if 
Ky =—+o 
g dt 21 
Here, @ is the angle between a’ and €,. Hints: (1) write a’ = cos@ €; + sin @ €) just as for V in 


Theorem 6.3.2 and carry out the proof of the theorem to get 
,__ {40 ; 
Vyo' = ae +2, | [—sin@ €; + cos@ €3]. 


(2) Note that o/, = Va’ and compare the formula above to the geodesic curvature formula of 
Theorem 5.4.12. (3) Why must it be true that U x T = —sin6@ €; + cos@ &3? 


Exercise 6.3.4 shows a link between geodesic curvature and holonomy. From Theorem 5.1.5 
we know that geodesic curvature depends only on the metric. Further, an isometry preserves 
angles, so x, and d@/dt are also preserved under isometry. Hence, so is w 2; and we have the 
following result that will allow us to understand holonomy better in Section 6.9. 


Theorem 6.3.5. /sometries preserve holonomy. 
Exercise 6.3.6. Let a: [a,b] — M be a unit speed curve on a surface. Take a frame along the 


curve, {7, V =U x T, U}, where U is the unit normal of M. Show that the natural (“Frenet”) 
equations for this frame are 


Tv = keV +kU 
V= —«,T +t, U 
U' = -kT —T, V 


Here, k is the normal curvature of T on M, kg is the geodesic curvature of a and t, is defined 
by t, = —U’-V = V’-U and is called the geodesic torsion of a. Recall that a curve a is a line 
of curvature of M if T is always an eigenvector of the shape operator of M. Using the natural 
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equations above, show that t, = 0 if and only if is a line of curvature. From this point of view, 
re-do Exercise 5.1.11. 


Exercise 6.3.7. Let U - N = cos@, where N is the principal normal of the unit speed curve a. If 
t denotes the usual torsion of a, show that the geodesic torsion obeys 


dé 
t= Eee . 


Show that, if a is a geodesic, then tr, = —r. Also, show that the converse of this statement does 
not hold by showing that d@/dt = 0 in the equation above for any latitude circle on the sphere. 
Now re-do Exercise 5.1.10. 


Notice what the proof of Theorem 6.3.2 and Exercise 6.3.3 above say in the special case of 
a unit speed u-parameter curve. In this situation, a’ = €), so we have that either the parameter 
curve is a geodesic, in which case 6 = @(0), or the parameter curve is not a geodesic, in which 
case the initial vector Vo is rotated by the holonomy @ (from the viewpoint of 3-space) as it moves 
along the curve. It is now time for the prime example. 


Example 6.3.8 (Holonomy on a Sphere). 
Consider the R-sphere with patch 
x(u, v) = (Rceosucosv, Rsinucosv, Rsinv). 


The parameter curve tangent vectors are x, =(—Rsinucosv, Rcosucosv,0) and x, = 
(—Rcosusinv, —Rsinu sinv, Rcosv), while the unit normal is given by U = (cosucosv, 
sin u cos v, sin v). An orthonormal basis for the tangent plane is then 


€; = (—sinu, cosu, 0), & = (—cosusinv, —sinusinv, cosv). 
Now take a latitude circle on the sphere at latitude vo, 
a(u) = (Rcosu cos vo, R Sinucos vo, R sin vo). 


Note that |a’| = Rcos vp is constant, so the covariant derivatives of €; and € with respect to 
a’ truly describe the changes of €, and € along @. Further, since the parametrization of « is 
compatible with that of €; and € along a and a’ = x,, the R°-covariant derivatives are simply 
derivatives with respect to u, 

VE Ae =(—cosu, —sinu, 0) 
Veer = (sinu sinvp, —cosusinug, 0). 


In order to find Vy€, = w2,€2, we must decompose VRE, = Ww; Eo +5, U (and similarly fo 
VRE, = —wy, €; +52 U). From the formulas above, we obtain 


(—cosu, —sinu, 0) = (—@2; cosu sin vg, —w2) Sin u SiN Vo, Wz} COS Uo) 


+ (S| COS u COS Up, 5] SINU COS Up, S| SIN Up) 
Therefore, we have 


@2; = Sin vo, S| = —COS U9. 
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Figure 6.3. Holonomy around the latitude circle with vy = 7/6 


Exercise 6.3.9, For VE ED = —m2) E; + s2 U, show that sz = 0. 
Var Ey = sin vo &, Ve Eo =—sin Uo E), 


By the proof of the theorem above, the holonomy along a is given by 


pln 


Qn 
- 3, du = -f sin vg du 
0 0 


= —27 sin vp. 


For instance, if v9 = 2/6, then the holonomy is —z. This is verified in Figure 6.3, a picture 
created by the Maple procedure in Section 6.9. 


Exercise* 6.3.10. Calculate the total Gauss curvature over the portion of the R-sphere above 
latitude vp and compare your answer to the holonomy around the vo-latitude circle. Up to the 
addition of a multiple of 277, what do you notice? 


The calculation of holonomy above says that, from our outside viewpoint, parallel tangent 
vectors rotate as they move along a latitude circle. Of course, as the terminology “parallel” 
signifies, 2-dimensional residents of the sphere see the vectors as parallel — so, from their 


viewpoint, not rotating at all. 


Exercise* 6.3.11. What happens at the Equator and why is the Equator special among the circles 
of latitude? 


Exercise 6.3.12. Give the cone 27 = a? (x? + y?) a parametrization 


rad v ‘ ‘ v 
X(u,v)= (u sin(d) cos(*—), nu sin(?p) sin( =), u cos(d)) 
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Figure 6.4. Cone on sphere along a latitude circle 


where ¢ is the vertex angle of the cone, sin(d) = 1/1 + a? and cos(¢) = a/V1 +a. Takea 
parallel 

a(v) = (wo sin(@) cos( 5) , Ug sin(d) sin( =) . ug cos(6)) 
with 0 < v < 27 sin(@) and, by imitating the example above, show that the holonomy around a 
is 27 sin(@). See Section 6.9. Hints: define €&; = x, /VG and & = x, (fora right-handed system). 


Show 2; =—1, so that holonomy is — f;"°"® —1 du = 2x sin(@). 


Exercise 6.3.13. Explain the result of the previous exercise by setting a sphere inside the cone 
so that the latitude v = —@ circle and the cone parallel @ coincide. Show that the unit normals of 
the sphere and the cone (and hence the tangent planes) also coincide along the curve. Therefore, 
the covariant derivatives of the cone and sphere agree along the curve as well. Hint: for the 
cone with vertex at the origin and vertex angle @, the center and radius of the sphere should be 
(0, 0, uy/ cos(P)) and uy sin()/ cos(P) respectively. We give a Maple approach to this exercise 
in Section 6.9. 


6.4 Foucault’s Pendulum 


Nature uses only the longest thread to weave her patterns, so each small piece of fabric 
reveals the organization of the entire tapestry. 


— Richard P. Feynman 


In 1851 Jean Foucault (1819-1868) built a pendulum consisting of a heavy iron ball on a 
wire 200 feet long to demonstrate the rotation of the Earth. (Foucault was also responsible for 
developing means for measuring the speed of light in various media. In particular, he measured 
the speed of light in air and in water and showed that the speed varied inversely with the index of 
refraction (i.e., Snell’s Law).) Foucault observed that the swing-plane of the pendulum precessed, 
or rotated, as time went on, eventually returning to its original direction after a period of T = = 
hours, where vp is the latitude where the Foucault pendulum resides. The usual explanation for 
this phenomenon of precession is in terms of rotating reference frames which produce a horizontal 


6.4. Foucault's Pendulum 285 


Coriolis “force” displacing the swing-plane (see [Sym71] and [Am78]). Here we shall look at 
the Pendulum in the framework of geometric phases or holonomy (see [WS89, Mar92, Opr95]). 

In order to analyze the Foucault pendulum from the viewpoint of geometry, assume the Earth 
to be non-rotating and the pendulum to be situated at latitude vg. Instead of the Earth rotating 
to move the pendulum, we move the pendulum once around the latitude circle in 24 hours at 
constant speed on this stationary Earth. This is clearly equivalent to the standard situation in 
terms of the ultimate movement of the pendulum. The long cable of the pendulum and the slow 
progression around the latitude circle have two consequences (which are given by the usual 
physics arguments). 

First, the long cable provides a relatively small swing for the pendulum which is then approx- 
imately flat. Hence, we may consider each swing as a tangent vector to the sphere. By orienting 
these vectors consistently, we obtain a vector field of pendulum swing plane directions V. At 
each moment of time rf there is such a swing direction vector V(t) and all these vectors may be 
placed along the latitude circle a(u) by associating a given moment of time ¢ with the unique 
point describing the pendulum’s movement along a(u). Hence we write V(u) for the swing plane 
vector field. 

Secondly, because we move around the latitude circle slowly, the consequent centripetal force 
on the pendulum is negligible. (In fact, it is approximately ss of the downward force mg.) 
This says that we may take the only force F felt by the pendulum to be gravity in the normal 
direction U. But such a normal force does not affect the vertical swing plane of the pendulum 
tangentially, so the swing plane appears unchanging to a 2-dimensional resident of the sphere. 
That is, projected to the tangent plane, 

agree ay 
du 
where the covariant derivative again reduces to the ordinary derivative due to our special para- 
metrization. By our earlier discussion, we then have 


Proposition 6.4.1. The vector field V associated to the Foucault pendulum is parallel along a 
latitude circle. 


Of course, as we transport the Foucault pendulum once around the latitude circle a, holonomy 
rotates the parallel vector field V by —2z sin vg radians. In particular, the angular speed of 
this vector rotation is then w = 2 Sin vo fads The equivalence of our geometric situation with the 
physical one then gives 


Theorem 6.4.2. The period of the Foucault pendulum’s precession is 


2 rads - 24 
w® — Sinvg 


Of course, this is precisely the period obtained in physics. Here, however, the precession of 
the swing plane of the Foucault pendulum results from the holonomy along @ induced by the 
curvature of the Earth. Further, since we view the whole pendulum apparatus as stationary relative 
to the Earth, what can explain the observed precession of the swing plane? As Foucault argued, 
we must have 
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Corollary 6.4.3. The Earth rotates along its latitude circles. 


Exercises 6.4.4. Suppose a Foucault pendulum is transported around a latitude circle on a torus. 
(You should still assume the only force is normal to the torus.) Compute the holonomy and 
explain whether this experiment alone can tell you whether we live on a sphere or torus. 


6.5 The Angle Excess Theorem 


Now let us return to geometry itself. Consider a unit speed simple closed curve 6 parametrized 
by 0 < s < 27. Let B denote the region “inside” 6 and assume that B is simply connected. That 
is, we assume that 6 may be continuously shrunk to a point in B. Of course we may use the 
same notation and formulas for B as for a above. Note, however, that we do not assume that our 
surface is in 3-space. We abstract the properties of the covariant derivative from our discussion 
above (see Exercise 6.2.3) and simply note that such a thing actually does exist. For example, 
we write Vg-€| = w21E and @ denotes the angle between f’ and €,. Because €; and € are 
orthonormal, we may. identify w; in two ways: first, 
do 

Vey “Ey = W2| = Kg — de 
by our formula (see Exercise 6.3.4) for geodesic curvature (of 8). Secondly, the left-hand side 
may be computed using the definitions of €; and £2 as x,,//E and x,//G respectively. Note 
that B’ = x,u’ + x,v’, so we have 


Xu du Xu dv 
Ve . — — —_— —|.- 
tease (Gs). awa) i & 


poe du, XwwVE — x,4E7 7 Ey =| Z 
=> ST —_—_—— . 2 


E ds E ds 


= Xuu x,E, | du - Xue x,E, | dv Xy 
“ALVE 2E32] ds | J/E 2E3?} ds} JG 
Using our usual formulas for x,,, and x,, (see Section 3.4), we obtain 


E, G, 
Xi? Xp = ce and xX,» -Xy = os 


Hence, these calculations and the assumed orthogonality of x, and x, imply 
Xu Ky —E, du - G, dv 
VE VG 2VEGds 2J/EG ds’ 


Exercise 6.5.1. Verify the formula above using the expressions for x,, and x,, derived in 
Section 3.4 (and ignoring U components). 


2, = Vg 


Exercise 6.5.2. The general formula for w2, derived above allows the determination of holonomy 
in a variety of situations. For instance, consider the Poincaré plane P = {(x, y) € R? | y > 0} 
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with the patch x(u, v) = (u, v) and conformal metric, 
Ww, W2 


w,ow = 7 
Vv 


where w), W2 € T,(P) and p = (yu, v). 


Show that the holonomy along the horizontal line v = 1, from (0, 1) to (a, 1) is equal to —a. This 
means that a tangent vector at (0, 1) is rotated by —a radians as it moves along v = | to (a, 1). 
Hint: Show that, in general for P, 2; = 1/vp along a u-parameter curve with v = vo. 


For the following we must recall Green’s theorem (Theorem 1.6.2). Let C be a simple closed 
curve which may be continuously shrunken to a point in its interior. We say such a curve is 
shrinkable. The line integral f c P dx + Qdy may be computed as an area integral 


fras+ody=| [(2-=) dx dy 


where R is the region inside C. Now, by the computation of w2; above and the identification 


we obtain 


d@ 
@2| = Kg — as> 


do E, du G, dv 
— =k = SS = =< 
ds 8% 2/EGds 2VEG ds 


We then integrate both sides with respect to s to get 
E, du G, dv 
— ds — fus=[(Se —— — — —~_ =) a 
he f 2JEG ds JEG ds 
E, G, 
= du — dv 
I 2VEG 2VEG 


Fale Gae)* ao (Jee) | 


by Green’s theorem. Then, by Theorem 3.4.1, we have 


= | [ xvBGauav 


-fx 


by the definition of integration on a surface (or patch) and the fact that, since F = 0, |x, x x,| = 


VEG — F2=VEG. 


Theorem 6.5.3. For a shrinkable simple closed curve B, the holonomy around B may be identified 
with the total Gaussian curvature evaluated on the region B inside B. Further, the total change 
in 8 around B is given by 


Qn) —00)= | xas+ f K. 
B 
Corollary 6.5.4. If B is a geodesic, then 


6(2m) — 6(0) = i K 
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Figure 6.5. A triangle in M 


Now, by our definition of 6 as the angle between f’ and €), we see that, for a closed curve f, 
the total change in @ must be an integral multiple of 27. 


Exercises 6.5.5. Explain this statement. 


Hence we have27 -A= of Kgds + f , K,where A isan integer. Now, to see what A is, intuitively 
we might think of what happens as we shrink f to a point. Since the shrinking process is 
continuous and the total turning is an integral multiple of 27, the total turning must remain 
constant throughout. Near a point, we may think of the curve as a tiny circle in the plane and £, 
as a fixed vector (e; = (1, 0) say). In this case it is clear that the tangent vector rotates exactly 
once as it is transported around the circle; that is, the total turning is 27. By continuity of the 
process, we must therefore have A = |. To make this argument precise requires a version of the 
Hopf Umlaufsatz (see [Hsi81] for example). 

Finally, we come to the main goal enunciated at the start of this chapter — an understanding 
of a fundamental concept in traditional geometry from the differential point of view. Con- 
sider a triangle (Figure 6.5) which is made up of three curves in a surface M and which is 
shrinkable. We say the angles i; are interior angles and their supplements m — i; are exterior 
angles of the triangle. Suppose we move around the triangle in the direction shown and con- 
sider the turning of the tangent vector(s) to the curve(s). The same intuitive argument above 
shows that we obtain a total turning of 27. There are two contributions to turning: (1) the con- 
tribution obtained as above via the geodesic curvature formula, f «,(A) + » K, and (2) the 
contributions obtained at the corners of the triangle, (7 — i,;) + (7 — i2) + (4 — i3). Hence, we 
have 


an = fag) + f K+ 08 ~in) +8 ~ ia) + ( ~i) 
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Theorem 6.5.6. /f a triangle in a surface is shrinkable to a point, then 
3 
2x = faa f K+) (x -ij). 
A : 
j=l 


Moving the angle sum to the other side and multiplying by —1 produces 
- f egay- f K =-2n4+(n -i))+(@ -—)+ (2-4) 
A 
=m — (i) +12 +13). 


Corollary 6.5.7. Ifa triangle A in a surface is shrinkable to a point and is made up of geodesic 
segments, then the sum of the interior angles of the triangle differs from n by (+ or —) the total 
Gaussian curvature. 


Corollary 6.5.8. Jf the surface M has constant Gauss curvature K and A is as in Corollary 6.5.7, 
then nm — (i, + i2 + i3) = —K - Area(A). 


Example 6.5.9 (Angle Excess). We have the following examples of angle excess. 
The R-sphere. K = on so 


=— dA= 


i 1 Area of A 
A R? Ji Re 


Hence Doi; —- 27 = ee Therefore, on a sphere, 


: Area of A 
SS > 


which verifies the traditional “parallel postulate” approach. 


Exercises 6.5.10. Show that the sum of the angles of a triangle in the hyperbolic plane H or 
Poincaré plane P is strictly less than 7. 


Thus, curvature affects even the most fundamental of geometric quantities, the sum of the 
angles of a triangle. In this sense, from the differential point of view, Gaussian curvature is the 
basic structural element of geometry. In the next section we shall consider a global version of 
Theorem 6.5.6 -—- the famous Gauss-Bonnet Theorem. 


6.6 The Gauss-Bonnet Theorem 


It is a fact that every surface M may be triangulated. That is, the surface may be completely 
covered by shrinkable triangles in the surface which meet only along edges or at vertices. 
Furthermore, an orientation on a surface induces orientations on each of the triangles so that the 
edge orientations are opposite when considered in adjacent triangles (see Figure 6.6). 

Suppose M has a triangulation with V total vertices, E total edges and F total triangles. 
Theorem 6.5.6 applies to each triangle, so we can add up both sides of the formula over all 
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Figure 6.6. Oriented triangles meeting along an edge 


triangles to get 


3 
> f eta Sy K+>° > @ -ij) =) 622. 

A a vA A j=l A 
Now, M may have boundary curves (which we will assume are smooth) just as for a cylinder 
of finite height. We denote this boundary by aM. In this case, we may split up the sets of vertices 
and edges according to whether they lie on the boundary or in the interior of M. Let V = V; + Vg 
and E = E; + Eg, where the subscripts / and B refer to the interior and boundary respectively. 

Then 


/ c+ | K+3n F-—2n V,;-—x Vg =2n F 
aM M 


since the sum of the interior angles at a particular vertex in V,, for all triangles intersecting that 
vertex, 1s 27 and the sum of the interior angles at a particular vertex in Vg, for all triangles inter- 
secting that vertex, is the angle of the tangent line to the boundary at that vertex, 2. Furthermore, 
each triangle has three edges surrounding it and two triangles share a common edge, except for 
edges in Eg which meet only one triangle. This leads to the equality 3F = 2E, + Eg. Replacing 
3F qr by (2E; + Eg), we then have 


/ ket f K420 Ey +m Eg 2a Vp Vp = Dn F 
aM M 


/ ket | K42m Ey tn Eg tn Es au Vp —2n V,-n Vp =2nF 
aM M 


since Vg = Eg. Then 
/ ket fl K+ 20 (Ey + Ep) — 20 (Vs + Va) = 20 F 
aM M 


f c+ | K+2nE—2nV=2n F 
aM M 


/ c+ f K=2n(V-—-E+F) 
aM M 
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Therefore, if we define x(M) = V — E+ F to be the Euler Characteristic of M, then we 
have 


Theorem 6.6.1 (Gauss-Bonnet Theorem). [fM is a compact oriented surface with boundary 0M 
made up of a finite number of smooth closed curves, then 


| e+ f K = 20 x00). 
aM M 


Corollary 6.6.2. If M is a compact oriented surface without boundary, then 
/ K =2n x(M). 
M 


Of course, the one point we have neglected up to this point is that the quantity x(M) = 
V — E4F may not be characteristic of M, but may depend on which triangulation is cho- 
sen. That this is not the case is a result which goes back to Euler. In fact, the result is more 
general. A graph G consists of finite sets of vertices V = {vz} and edges E = {e;;} where 
ej; denotes the unique edge joining v; and v;. The graph is connected if there is an edge 
path joining any two vertices. We shall only consider connected graphs. A graph G is em- 
bedded in a surface M if there is a map f: G — M such that the images of edges never 
intersect except at vertex images. An embedding is a 2-cell embedding if the surface polygons 
(or faces) determined by the embedding are simply connected (and, hence, look like deformed 
2-disks). 


Exercise 6.6.3. Embed a small graph in the plane and calculate V — E + F. Be sure to include 
the face at infinity, the face which surrounds the entire graph (which can be considered a 2-cell 
on the sphere). Do the same for 2-cell embeddings on the sphere, torus and cylinder. Be sure your 
embeddings are 2-cell. 


Theorem 6.6.4 (Euler). Any embedding of a graph in the plane gives V — E + F =2. 


Proof. The proof is by induction on the number of edges in the graph. The reader can easily 
check that the result is true for graphs with one or two edges. Let G be a graph with E edges. 
There are two cases. The graph G may be a tree; a graph enclosing no faces. Then E = V — 1 
and F = | (the face at infinity), so V - E+ F =V—-—(V—1)+1 = 2as desired. If G is nota 
tree, then G has a cycle, a path of edges which form a closed curve in the plane. This cycle has 
an inside and outside by the Jordan curve theorem, so there is an edge e which bounds two faces. 
Remove this edge from G. Note the new graph G is still connected since e was on a cycle and the 
number of faces has been reduced to F — 1. By the inductive hypothesis on G (since it has E — 1 
edges), V —(E — 1)+(F — 1) =2. But simplifying the left-hand side gives V- E+ F =2 
which applies to the original graph G. O 


Exercise 6.6.5. Using the proof above as a model, show that the Euler characteristics x = 
V — E+ F for the sphere, torus, disk and cylinder are, respectively, 2, 0, 1 and 0. For the disk 
(i.e., a circle together with its interior), use the theorem for the plane and remove the face at 
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Figure 6.7. A pretzel 


infinity. Explain why the sphere and plane both give V — E + F = 2 for 2-cell embeddings, 
Hint: stereographic projection. 


If we disregard geometry and only consider shape up to continuous deformation (for example), 
then any compact oriented surface (without boundary) is known to assume the form of a sphere 
with handles. A sphere with g handles, S,, is obtained by cutting 2g disks out of the sphere and 
gluing in g cylinders along the boundary circles. Of course, the sphere is simply So and the torus 
is Sy. 


Exercise 6.6.6. Show that any 2-cell embedding of a graph on S, gives V - E+ F =2- 
2g. Hence, the Euler characteristic of a compact oriented surface without boundary is always 
of the form 2 — 2g, Indeed, for a compact oriented surface (without boundary) M. the Euler 
characteristic characterizes the surface M up to homeomorphism. 


Exercise+ 6.6.7. We have seen in Exercise 5.4.9 that a torus may be given a metric so that 
the Gauss curvature K = 0 at every point. Show, using Gauss-Bonnet, that the torus can never 
be given a metric with K <0 globally and K <0 at some point. This indicates how a non- 
geometric (but deformation) invariant such as the Euler characteristic can constrain the geometry 
of a surface. Further. show that any surface with positive Gauss curvature is deformable to the 
sphere. In fact, such a surface must be “topologically equivalent” to the sphere. 


Consider what the Gauss-Bonnet Theorem is really saying. For instance, a sphere may be 
stretched and twisted (Without ripping) to produce various geometrically distinct surfaces such 
as the “pretzel” of Figure 6.7, 

But, while the Gauss curvatures of these surfaces are clearly not constant, because these 
surfaces are deformed spheres, the Gauss-Bonnet Theorem says that the fota/ Gauss curvatures 
are always constrained to be 42. Somehow, the deformation shape of the surface — which is 
reflected in the global invariant x — acts to determine a global geomerric invariant, the total 
Gaussian curvature. The idea of constraining geometry by such deformation invariants has led to 
many major results in Mathematics, In this way, the Gauss-Bonnet Theorem is the ancestor of 
much of the spirit of modern differential geometry and algebraic topology, (See [Got96] for an 
interesting perspective on Gauss-Bonnet and the Euler characteristic.) 


6.7 Applications of Gauss-Bonnet 


There are other ways in which Gauss-Bonnet constrains the geometry of a surface. The following 
example indicates how geodesics may be affected. 
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Figure 6.8. A possible closed geodesic? 


Example 6.7.1 (Closed Geodesics on the Hyperboloid of One Sheet). 
Consider M: x* + y* — 2? = 1, the hyperboloid of one sheet. Recall that M is a surface of 
revolution parametrized by 


x(u, v) = (coshu cosv, coshu sinv, sinhu). 


The central (v-parameter) circle a = x(0, v) = (cos v, sin v, 0) is a (closed) geodesic since h(u) = 
cosh u and h’(0) = sinh(0) = 0. In fact, using the Gauss-Bonnet Theorem, we can show that this 
circle is the only closed geodesic on M. To see this, suppose that 6 is another closed geodesic 
on M. There are two possibilities; either a and are disjoint or they intersect. Suppose they are 
disjoint. Then a and f are the smooth boundary curves of a portion of the hyperboloid M which 
can be deformed to a cylinder and which therefore, by the exercises above, has Euler characteristic 
zero. Now, as part of the hyperboloid, M has K < 0 at each point. Hence, Sa K < 0 contradicts 
the Gauss-Bonnet formula a K = 2mx(M) = 0. (Here we have used the fact that a and f are 
geodesics to obtain f,7, ky = 0.) 

Now suppose that @ and £ intersect in (necessarily) two points making interior angles ¢;, both 
of which are less than zr. 


Exercise 6.7.2. Justify this last statement, @; < 7. Hint: use the uniqueness of geodesics through 
a point with given initial tangent vector. 


The region R (see Figure 6.8) bounded by a@ and 6 with interior angles ¢; is simply connected 
and, so, is deformable to a disk. Indeed, we may consider the region to be a triangle with one 
vertex on f (say) with angle 7 and the other two vertices to be the points of intersection with 
interior angles ¢;. Again, since R is part of the hyperboloid, /, pK < 0. The local Gauss-Bonnet 
theorem then says 


[ k+0r-m 4-04-62) = 28 
R 


[k=640 
R 


[x>o 
R 
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which is a contradiction. Hence, there are no other closed geodesics. 


Exercises 6.7.3. Show that, on a surface with K <0, no closed geodesic bounds a simply 
connected region (i.e., a region deformable to a disk). 


In order to understand the next application of Gauss-Bonnet, we need some background. We 
mainly follow the approach of [So196]. Suppose @(s) is a unit speed curve which lies on the unit 
sphere S? C R?. Let the sphere’s unit normal be denoted by U. As in Section 5.1, we can write 


a” =K,U x T +(e" -U)U 
=k,UxT—U, 


since, on the R-sphere, the shape operator has the form S(@’) = —a’/R (see Example 2.2.13) and 


a” -U =a’- S(a’) by Lemma2.4.1 
_pr/ 
ee ln 


R 


=-] sincea’-a’ = landR=l1. 


Note that, by Exercise 5.1.1 and our convention that the curvature of a curve is positive, this 
computation also says that ky > 1 along all of a. Also, observe that, since @ lies on the unit 
sphere, we have a(s) = Us) for all s. Thus, we can write a” =k,a x T—a. 


Exercise 6.7.4. Show that (U x T)' = —x,T with notation as above. Hint: use the Leibniz rule 
for differentiation, properties of cross products and the formula a” = kya x T — a. 


Now, a’(s) = T(s) is also a curve on the unit sphere S? and it is called the tantrix of a. The 
word “tantrix” was coined in [Sol96] as a contraction of the classical term tangent indicatrix . 
Note that the tantrix T(s) is not necessarily unit speed since |T’(s)| = kg by definition. Let t be 
the arclength parameter of the tantrix T(s); 


ae dt , 
rs)= | IT (u)| du, A Sates) 
0 AY 


Because dt/ds > 0 always, we know that there is an inverse function s = s(t) with 


ds 1 1 


dt dt/ds ky 


We then can take the derivative of a with respect to t and get 


da dads ds 1 
ee ea Tp 
dt ds dt dt Ko 
Define a vector field along T(t) by Vi, = a(t) and notice that Vry)- T(t) = a(t): TH) = 
Ua» T(t) = Una) - a(t) = 0. Hence, a(t) is a vector field along T(t) which is tangent to the 
sphere. In fact, w(t) is a parallel vector field along T(t). To see this, note that 
da 1 


1 
—=--—T=—U 
dt Ka Ka 7o 
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since T(t) is a curve on S*. Hence, da:/dt is in the normal direction with zero projection onto the 
tangent plane. This means that @ is parallel along T(t). 

Now let’s carry out the same sort of argument as in Theorem 6.3.2, but in the present situation 
for a tangent vector field along a curve f on S? of the form V = cos(@) T + sin(@) U x T (where 
@ is the angle between V and T). Here we use the fact that parallel vector fields have constant 
length (which we scale to be 1). From the calculations above, we have 


PS 4.0 XT HU; (U xT) =-«,T, 
where «, denotes the geodesic curvature of 8. We then obtain for the derivative of V; 
V’ = —sin(d) d’ T + cos(¢) T’ + cos(o) ¢’ U x T + sin(d)(U x TY 
= —(¢’ + k,)sin(¢) T + (¢' + k,) cos(@)(U x T) — cos(¢) U. 
Of course, V is parallel when the projection of V’ onto the tangent plane is zero. Because sin(?) 


and cos(#) cannot be zero simultaneously, V is parallel if and only if ¢’ = —kg. 


Theorem 6.7.5. Ifc(s) is a unit speed closed curve on S? with (closed) tantrix T(s), then the total 
geodesic curvature of T(s) is zero. Furthermore, if T(s) is simple (i.e., without self-intersections), 
then T(s) encloses a region of area 2x. 


Proof. Regard a(t) as a parallel vector field along T(t) as in the discussion above. Let L be the 
period of a(t) (i.e., a(0) = a(L)) and, hence, T. The geodesic curvature of T(t) then satisfies 
Kg = —¢’ and the total geodesic curvature is 


[=-[ #ar=0@-60)=0 


since a is closed. Now, if T is simple, then the Jordan curve theorem says that the region R 
enclosed by T is a disk. The Euler characteristic is then seen to be x(R) = 1 by Exercise 6.6.5. 
The Gauss-Bonnet theorem, He 7 Kg = 0 and the fact that S 2 has constant Gauss curvature K = 1 


then give 
feet ff K dA=2z x(R) 
T R 


0+ ff iaa=20 
R 


Area(R) = 27. 
im 


Theorem 6.7.5 has as a consequence a famous and beautiful theorem of Jacobi concerning 
closed curves in R?. 


Corollary 6.7.6 (Jacobi’s Bisection Theorem). Let a(s) be a unit speed closed curve in R? with 
k > 0 everywhere along a. Let N(s) be the principal normal of a thought of as a closed curve 
on S*. If N(s) is a simple curve, then N(s) encloses a region of area 2x. In other words, N(s) 
bisects the area of S*. 
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Proof. Let T(s) denote the tantrix of a and reparametrize it by T(t) to have unit speed. Then 7 (t) 
is a closed unit speed curve on S* with its own tantrix 
dT dT ds ds | 
T(t) = — = — — =& N(s) — = ky N(s) — = N(s). 
ede de Od 
Because the tantrix of T is the principal normal N of a, the result follows from 
Theorem 6.7.5. OU 


As our final application of the Gauss-Bonnet theorem, we present a theorem of Hadamard 
which mixes a curvature condition with certain topological characteristics to produce a powerful 
geometric consequence. We follow the exposition in [Che67]. We say that a surface is convex 
if it lies on one side of each of its tangent planes. A surface is closed if it is compact without 
boundary. 


Theorem 6.7.7 (Hadamard’s Theorem). Jf M is a closed orientable surface in R? with K > 0 
everywhere, then M is convex. 


Proof. The Gauss-Bonnet theorem says that 0 < dy K dA = 27 x(M). Thus, x(M) > 0. By 
Exercise 6.6.6, this means that y(M) = 2 and tia KdA=4n. 

Now consider the Gauss map G: M — S? and recall that the negative of the shape operator 
—S, may be identified with the derivative mapping G,. at p (see Section 2.3). The Gauss curvature 
is given by the determinant of S,, so our hypothesis K > 0 says that det(S,) > 0 forall p € M. 
This implies that G, 1s an invertible linear transformation for all p. By Theorem 2.3.8, there exists 
an open neighborhood ©, about p such that G: O, > G(O,) C S? is one-to-one onto the open 
set G(O,) with a smooth inverse function. (In other words, O, and G(Q,) are diffeomorphic via 
G.) This is true for every p € M, so G(M) is an open set in S. (That is, every point of G(M) is 
contained in an open set of S* consisting entirely of other points in G(M).) But M is compact, 
so, by Proposition 2.1.4, G(M) is also compact and, therefore closed. Hence, G(M) is both open 
and closed in the connected set S?. By Exercise 2.1.2, G(M) = S?. 

We now want to show that G: M — S? is not only onto, but one-to-one as well. Suppose 
not. Then there are x, y in M with G(x) = G(y). Let O, and O, be two neighborhoods given 
by the Inverse Function Theorem as above. We may assume they are disjoint by shrinking 
them appropriately. Of course, since G(x) = G(y), we have a non-empty intersection V = 
G(O,)N G(O,). Let W = Gig\(V) and note that G(M — W) = S? still since G(G|g)(V)) = V, 
By Proposition 3.1.8, we have | 


i K dA = Area(G(M — W)) > 4x 
M-—W 


since G(M — W) = S? and G(M — W) may multiply-cover S*. But, since K > 0 on W CM, 
we have fy, K dA > 0 and 


[ xaa=[ Kaas f KdA>4nx 
M Ww MW 


which contradicts our earlier calculation /, u K dA =4n. Thus, G is globally one-to-one as 
well as being onto. But this means that G: M —> S* is a diffeomorphism because, by our 
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earlier application of the Inverse Function Theorem, the local inverses (which must then be the 
restrictions of the global inverse function) G~! are smooth. 

Finally, let’s see how G: M — S? being one-to-one and onto implies the convexity of M. 
Because G(p) = U(p), where U is the unit normal of M, the diffeomorphism G: M —> S? says 
that, for each direction v in R’, there is precisely one point p in M at which U(p) points in 
that direction v. So, choose a direction v € S? and find the unique point p € M with U(p) =v. 
Take the tangent plane 7,(M) (situated at p) and move it in the direction v far from M. Define 
a function f: M —> R by f(q) = distance of q from T,(M) in the v direction. Then, f is 
continuous and M is compact, so by Proposition 2.1.4, f has a minimum m on M. Move T,(M) 
back in the —v direction and the first point of M it hits must then be m. It can be seen that we 
must then have U(m) = v = U(p). But this contradicts the uniqueness of normal vectors unless 
m = p. So, since p is the minimum distance point from the moved T,(M), when we move T,(M) 
back to p, all other points of M lie on the far side of T,(M). Since this is true for every p, M is 
convex. O 


6.8 Geodesic Polar Coordinates 


Finally, let us look at an alternative framework for the angle-sum theorem Corollary 6.5.8. 
Although we have mentioned geodesic polar coordinates previously in Chapter 5, we have not 
described their construction. We shall do this now and see how they allow a conceptually different 
approach to the angle-sum theorem as well as yet another example of the influence of curvature on 
global geometry. A full (and rigorous) discussion of geodesic polar coordinates is rather technical 
and we will not go into it here. A nice (and relatively elementary) discussion may be found in 
[O’N66] and much of what we do below mimics this. Many of the details below are left to the 
reader. 

Fix a point p € M and let x(u, v) denote a mapping from a portion of the plane obtained by 
using v as the angle ofa unit tangent vector w at p from a fixed unit tangent vector e; and u as the 
“distance” along the unique unit speed geodesic a@ with a(0) = p, a’(0) = w. More specifically, 
let e; and e) be perpendicular unit tangent vectors at p and write w = cosve; + sinv e). Then 
X(U, UV) = Gy(u), where the subscript is to remind us that @ is determined by w, which, in turn, is 
determined by the parameter v. Of course we must restrict u so that x(u, v) is defined, so x(u, v) 
maps onto a (generally small) neighborhood of p. The key result, which follows from the inverse 
function theorem and which we shall not prove, is 


Theorem 6.8.1. The mapping x(u, v) is a patch for u > 0. That is, x(u, v) is a smooth map 
with smooth inverse from an open set in the plane onto an open neighborhood of p € M and is 
one-to-one for u > 0. 


Note the exclusion of p from the parametrization. This exclusion follows for the same reason 
that ordinary polar coordinates exclude the origin in the plane. As we have mentioned, such a 
patch is called a geodesic polar coordinate patch. 


Exercise 6.8.2. Using the fact that straight lines are geodesics in the plane (with the usual 
Euclidean metric), show that geodesic polar coordinates about the origin in the plane are, in fact, 
ordinary polar coordinates. 
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Figure 6.9. A geodesic polar patch 


Exercise 6.8.3, Let e; = (1,0), @2 = (0. 1) and switch the roles of u and v in the usual patch for 
the unit sphere. Show that this patch provides geodesic polar coordinates around the North pole 
N = x(z7/2. v). Here, note that it isn’t necessary to begin the geodesic parametrization at f = 0. 
How much of the sphere does this patch cover? 


Exercise 6.8.4. The usual polar patch for the hyperbolic plane x(u, v) = (uv cos uv, u sinv), does 
not give unit speed radial lines through (0, 0). Show that the arclength of these lines is given by 
s(u) = 2 arctanh(w/2) and that a unit speed reparametrization is then given by 


B(s) = (2 tanh > cos vy, 2 tanh = sin v0) 


for fixed Uy. Remember to use the hyperbolic metric here! Now reparametrize the patch x and 
show that you get geodesic polar coordinates around zero. 


The key to using geodesic polar coordinates is the following 
Lemma 6.8.5 (Gauss’s Lemma), The patch x(u, v) has E = 1, F =O andG > 0. 


Proof. E =x, + %, = | because u-parameter curves ay, (u) are unit speed. Hence, x,[E] = Ey = 
0. The product rule, however, gives (using the commutation rule, Exercise 6.2.3) 


0=£, 
= V5, (Xy + Xu) 
= 2 Vy Ry “Ry 
= 2 Ve, Xy > Xx 


Now, since u-parameter curves are geodesics with Vy x, = 0, we obtain 
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Therefore, F only depends on v. But now consider the v-parameter curve x(uo, v) and note that, 
at u = 0, x(0, v) = p is constant. Therefore, at p, x,(0, v) = 0 for every v and (for fixed v) 

lim F = lim x,(uo, v)- X,(uo, v) = 0. 

ug—0 ug—>0 
Since F doesn’t depend on u, F = 0 along this curve with v = constant. But the same argument 
applies for every v, so we must have F = 0 identically in the patch. Thus, the u-parameter curves 
are radial geodesics (from p) and the v-parameter curves are the orthogonal trajectories to these 
geodesics. Finally, because x is a patch, G = EG — F? > 0. O 


Remark 6.8.6. Note that the proof above contains the fact that /G(0, v) = |x,(0, v)| = 0. We 
will use this below as an initial condition for a very special differential equation. 


Exercise 6.8.7. Use the argument of Chapter 5 to show that, for any point g in the geodesic 
polar patch, the radial geodesic from p (1.e., the u-parameter curve) gives the shortest arclength 
of any curve joining p to q. Further, from the argument, infer that no other curve gives this 
arclength. 


Since the point p (called the pole) is a trouble spot for the patch, we must be careful in making 
inferences about the geometry near it. In order to understand what happens there, as is our custom 
now, we first construct a frame from the geodesic polar patch, 


Xy 
VG 
The u-parameter curves are geodesics, so €; is parallel along them. The vector field €2, however, 


has constant length | and maintains a constant angle of 7/2 with €, so (by Exercise 6.3.3) is 
parallel along radial geodesics as well. That is, Vx,& = 0. Let 


& =X, & = 


E\(0) = cos up €] + sin vp €2 &,(0) = — sin vp e; + cos v9 2 


where e;, e2 are fixed unit tangent vectors at p. Let a(u) = x(u, vo) and consider the following 
covariant derivative along this u-parameter curve, 


Vx,Xy = Vx,(VG &) 
= (VG), £2 + VG V5, 
But €) is parallel along a, so Vx,&, = 0. Thus, 
Vx, Xu = Vx, Xv 
= (VG)u E2. 
In particular, as we approach p, 


X,y(0, Vo) = lim(VG)u(u, v9))€2(0). 


We can also compute this quantity another way using x, = €). We have €)(0) = cos vg e; + 
sin vg €2 = x,(0, v9) and we may write x,(0,v) =cosve; +sinuve). Then we also can 
compute V,,x,(0, v) = — sinve; + cos vez and V,x,x,(0, vo) = — sin v9 e + Cos v9 €2 = E(0). 
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Therefore, 
lim(VG),(u, v0))E2(0) = E2(0) 
and 
lim(VG),(u, v0) = 1. 


Notice what this calculation says about the metric near p. We knew previously, of course, that 
E = 1 and F = 0 everywhere on the patch, so the fact that G is close to u* near p says that 
the metric around p is close to being Euclidean (in polar coordinates)— at least to first-order 
derivatives. This indicates also that curvature should be given by a second derivative of /G. 
This is verified in the important exercise below. 


Exercise 6.8.8. Show that K = —(1//G)(/G),, by writing out the chain-rule formula for 
(/G),,, and comparing the result to that obtained by plugging E = 1 into Theorem 3.4.1. A little 
algebra produces the Jacobi Equation along a radial geodesic, (WG), + K /G = 0. 


In order to understand the angle-excess theorem from the viewpoint of geodesic polar co- 
ordinates, we must deal with non-radial geodesics as well as radial ones to form a trian- 
gle. With this in mind, let y be a unit speed geodesic which intersects a radial geodesic 
a(u) = x(u, vo). We may write y(t) = x(u(t), v(t)) and note that y’ =x, “ +x, ay with 
(du/dty + G(dv/dt) = 1 since y is unit speed. Also, the angle ¢ at which y and aq inter- 
sect is found by taking the inner product (denoted o) of their tangent vectors, 


d 
cosg¢ =x, 0p’ = oa 


Hence we obtain G(dv/dt? = 1 — (du/dt) = sin? ¢ and so sing = /G dv/dt. Now we dif- 
ferentiate cos@ = x, o y’ along y to get 
d 
—sing < = Vyx, oy’ +x, 0 Vyy’ 
= Vyx,0y' 


/ 
= Vy, dx, eXu OY 


_ du dv 
ar —V,,X%, OY + = VaXu oy’ 


_ dv 


ery — Vx, Xu ° y 


dv ; 
=, 57 (YOu En oy 


: (2) wa VE. 
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B 


Figure 6.10. A geodesic triangle 


Then, replacing sing with /G & produces 


dp dv 
dt (VG. dt 
do = (VG), dv 


in differential form. The calculation above relates the angle of intersection of geodesics to the 
rate of change of JG (along a radial geodesic). We can therefore apply this to a triangle made up 
of two radial geodesics and a non-radial geodesic which intersects them both to give a different 
proof of the angle-excess theorem (Corollary 6.5.8). 


Exercise 6.8.9 (Following [Cox69]). Suppose a geodesic triangle in M is made up of two radial 


geodesics a(u) = x(u, vo), B(u) = x(u, v;) and a unit speed geodesic y(t) = x(u(r), u(t)) which 
intersects a and 6 as shown in Figure 6.10. Prove that 


[x=atete-n 
A 


Hints: (1) Use geodesic polar coordinates. (2) Start with the formula for total Gauss curvature 
and justify the calculation below, 


[x=] [xvec— Pavan 


=f [ kvGduaw 


=f [ -Grududv 
= [1-WGudv 


Uy Cc 
= / dv+ | do 
Uo x-B 


=A+B+C-—n. 
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Now, from the start, we constructed a geodesic polar coordinate patch only in a neighborhood 
of a point p € M. It may (and does) happen that, if we try to extend the patch beyond this 
neighborhood — even to its boundary — then we lose the qualities of a patch. Namely, we 
may lose one-to-oneness or we may find that there is a point x(uo, v) with G(uo, v) = 0 so that 
regularity of the patch fails. This is a situation we can analyze and through which, once again, 
we will see the powerful effect of curvature on geometry. 

Let x(u, v) be a geodesic polar patch on M with pole p. Say that p = x(uo, v) is conjugate 
to p along the (radial) geodesic determined by v if G(uo, v) = 0. Of course, we can take 
geodesic polar coordinates about any point p € M, so we can get away from taking a specific 
parametrization when we need to and simply consider conjugate points along arbitrary geodesics. 
The following theorem is the key to understanding the relationships among geodesics, conjugate 
points and shortest arclengths. The same argument as in Exercise 6.8.9 above (as well as Chapter 5) 
essentially proves the first part. The second part is more technical and harder, but plausible from 
the discussion above. The third part is the Hopf-Rinow theorem which we mentioned in Chapter 5. 
(In the following, we say that a curve £ is close to a if the distance from a(t) to B(t) never exceeds 
a certain small fixed amount.) 


Theorem 6.8.10. The following “shortest distance” results hold. 


(1) Ifa is a geodesic joining p € M tog € M and there are no conjugate points to p along a 
between p and q, then a gives the shortest arclength of any curve which is close to a and 
which joins p and q. 


(2) If p = ato) is conjugate to p = a(0) along the geodesic a, then a cannot give the shortest 
arclength of any curve (even close to a) joining p to any q = a(t)) forall t; > to. 


(3) If M is geodesically complete, then any two points of M may be joined by a geodesic which 
has the shortest arclength of any curve between the two points. 


Example 6.8.11 (Shortest Length Curves on the Sphere). 

The parametrization of the unit sphere obtained by switching the roles of u and v in the usual 
geographic parametrization gives G = cos? u. The North pole, u = 2/2, gives G = 0 and the 
next such zero occurs at the conjugate point u = —z/2, the South pole S. Of course a longitude 
(i.e., u-parameter curve) is a geodesic which gives the shortest arclength of any curve joining N 
and 5, but once we go past S on the great circle to a point g, then the geodesic giving the shortest 
arclength from N to q is the longitude on the other side of the sphere. 


Conjugate points may be determined from curvature in the following way. We have seen in 
Exercise 6.8.8 that /G along a geodesic obeys the following Jacobi differential equation with 
initial conditions: 


(VG) + K VG =0 
subject to 


/G(0, v) =0 
(VG),(0, v) = 1 
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Figure 6.11. A non-minimum length geodesic 


for all v. But the solution to such an equation is uniquely determined by the usual uniqueness 
theorems of differential equations theory. Therefore, in order to find a zero of G (i.e., a conjugate 
point of p) along a geodesic, it is sufficient to solve the differential equation abstractly and then 
find a zero of the abstract solution. Specifically, solve the differential equation (which we also 
call the Jacobi equation) 


Fae K f =0 
subject to 
f(0) =0 
f'0)=1 


for a function f and then find ug with f(uo) = 0. 


Example 6.8.12 (Jacobi Equation on the Sphere). 

The unit sphere, with the parametrization of Exercise 6.8.3 (and Example 6.8.11), has K = 1 
and initial conditions f(7/2) = 0, f’(/2) = 1. The general solution of f” + f =0 is given 
by f(u) = A cosu + B sinu and the initial conditions give B = 0 and A = —1. Then, f(u) = 
— cos u and the next zero for f afteru = 2/2 is up) = —m/2 — the South pole. Note that, because 
radial geodesics from the North pole WN are unit speed, the arclength to the South pole is 


—x/2 
/ ja’| dt 
n/2 


L(@)= 


Exercise 6.8.13. Consider the following problems involving the Jacobi equation. 


(1) For a plane, where and when do geodesics give the shortest arclength? Find conjugate 
points by solving the Jacobi equation with K = 0. 
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(2) For the hyperbolic plane, where and when do geodesics give the shortest arclength? Find 
conjugate points by solving the Jacobi equation with K = —1. 


(3) Generalize the first two parts to the following: if a surface M has K < 0, then there are 
no conjugate points along any geodesic. Hence, any geodesic gives the shortest arclength 
for all curves near it. Hints: show that f(u) > u by using the initial conditions f(0) = 0, 
f'(0) = 1 and the Jacobi equation to derive the consequence f’(u) > 1. Hence, f(u) 4 0, 
so no conjugate point can occur. 


Finally, we come to the main result of this section. In order to give a proof, we first need a 
basic comparison result from differential equations. (Also, we use the fact that, just as in the 
sphere example above, because geodesics are unit speed, the parameter u along the geodesic is 
precisely the arclength along the geodesic.) 


Theorem 6.8.14 (Sturm-Liouville). Let y”(t) + g(t) y(t) = 0 be a second-order linear differen- 
tial equation with initial conditions y(0) = 0 and y'(0) 4 0. Suppose, for all t, thata? < g(t) < Bb’ 
and let t) > 0 be the first positive solution to y(t) = 0. Then 


Xu 
S05 


ae | 


rs . 
We can apply the Sturm-Liouville comparison theorem to the Jacobi equation in the situation 


where Gauss curvature is positive and bounded away from zero. (In order to ensure the proper 
domains for geodesics, we again refer to the notion of “completeness” mentioned in Chapter 5.) 


Lemma 6.8.15. Suppose M is complete and K > a* > 0. Then any geodesic a: [0,00) > M 
has a conjugate point somewhere in the interval (0, m/a). 


Proof. We apply the Sturm-Liouville theorem to the Jacobi equation f” + K f = 0 to see that 
the first conjugate point (1.e., first zero of f) occurs for ug < m/a. O 


We can globalize this local result by asking how big M itself can be under the curvature 
constraint K > a* > 0. From Theorem 6.8.10 above, if M is complete, then any two points 
p.q@ € M are joined by a shortest length geodesic. Thus, it makes sense to define the diameter 
of M to be the length of the longest shortest-length geodesic joining any two points of M. In 
fact, this length may not be attained, so we must take a supremum of such lengths. Formally, we 
define the distance of p from q, d(p, q), to be the arclength of the shortest geodesic joining p 
and qg. Then 


Diam(M) © sup{d(p, q)| p andg € M}. 


Note that a finite diameter for M C R? implies that M is bounded. Recall that a closed bounded 
subset of R? is called compact. The notion of compactness is, in fact, more general, but if it is 
unfamiliar, then the reader should simply consider the R? situation. Lemma 6.8.15 then gives 


Theorem 6.8.16 (Bonnet). Suppose M is complete and K > a* > 0. Then Diam(M) < z/a and, 
consequently, M is compact. 
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Proof. Let p.q € M be arbitrary points. Since M is complete, there is a shortest arclength 
geodesic joining them. Lemma 6.8.15 implies that the first conjugate point occurs along this 
geodesic at up < 2/a. Since the geodesic is shortest length and we know this cannot be the case 
after the first conjugate point, then the arclength of the geodesic must be < 2/a. Because this is 
true for all p,g € M, then Diam(M) < z/a. Hence, M is bounded and, because it is complete, 
it is therefore compact. O 


Once again we see that hypotheses on Gauss curvature have powerful consequences. We 
saw in Chapter 3 that compactness implies a local result: namely, that the curvature is positive 
for at least one point of a surface. Now we have seen that the global hypothesis of positive 
curvature (bounded away from zero) throughout the surface implies compactness of the surface. 
The following exercises point out the limits of the hypotheses and conclusions. 


Exercises 6.8.17. The hyperboloid of two sheets x? + y* — z? = —1 and the paraboloid z = 
x? + y? have nonnegative Gauss curvature and are complete since they are closed in R?. 
They are not bounded however, so they are not compact. What goes wrong here with Bonnet’s 
Theorem 6.8.16? Look at the formulas for the Gauss curvatures of these surfaces. 


Exercise 6.8.18. 1s the converse of Bonnet’s theorem true? That is, if M is compact (say, closed 
and bounded in R?) with Diam(M) < z/a, then is it true that K > a* > 0? Hint: estimate the 
diameter of a torus and compare its Gauss curvature. 


Exercise 6.8.19. In Example 6.8.12 and Exercise 6.8.13 above, the Jacobi equation f” + K f = 
0 was solved for K = 1, K =0 and K = —1. Use these solutions to interpret how geodesics 
starting from the same point come together or grow apart. To do this, note that a solution (by 
uniqueness) must be /G for a particular geodesic polar patch and that /G = |x,| measures the 
instantaneous rate of change of a radial geodesic in the v-direction. In Chapter 5 we considered 
geodesics as paths of particles constrained to lie on surfaces under no external forces. The 
interpretation above shows that, in particular, equilibrium positions of such particles on surfaces 
of negative curvature exhibit exponential instability (see [Am78, Appendix | IJK]). 


6.9 Maple and Holonomy 


Parallelism, together with its offspring holonomy, is a difficult concept to grasp. From a 3- 
dimensional perspective, the vectors that are deemed “parallel” along a curve rarely look parallel. 
So how can we understand parallelism and holonomy in a reasonably simple way? Our goal will 
be to use Theorem 6.3.5 to transform the holonomy of an easily understood situation to that of 
a more complicated (and less intuitive) one. In this way, we will see that holonomy is simply 
a natural reflection of the way piecing together simple local geometries into a global structure 
leads to complexity. 

First, let’s see how Maple can be used to display holonomy on the sphere. The idea is this. 
Fix a tangent vector on the sphere (“holovec” below) and a frame E,, E> at that point. Create 
frames evenly spaced along the latitude circle by defining a loop and taking ek := subs({u = 
2* Pixi/n,v = v0}, Ek). Then rotate the frames by multiplying the frame vectors by the 
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holonomy —2z sin(vo) (with a factor 3/4 to make the vectors smaller). We use the “arrow” 
command to create a tangent vector starting at the latitude circle and going in the direction 
of the rotated vector. Finally, we plot the set of vectors and the sphere simultaneously using 
“display”. The inputs for the procedure are the latitude vp, the number of vectors n and orientation 
angles. 


> with(plots) :with(LinearAlgebra) : 

> holosph:=proc(v0,n,ori1,ori2) 

local sph,E1,E2,e1,e2,lat,holovec,i,vec,vecc,sphere,VV; 
sph:=<cos(u)*cos(v) |sin(u) *cos(v) |[sin(v)>; 
E1:=map(diff,sph,u)/VectorNorm(map(diff,sph,u) ,Euclidean, 
conjugate=false) ; 

E2:=map(diff,sph,v) ; 

lat :=subs(v=v0, sph) ; 

holovec:=<0/1[0>; 

for i from 0 to n do 

e1:=subs ({u=2*Pi*i/n, v=v0},E1); 

e2:=subs ({u=2*Pi*i/n,v=v0} ,E2); 

vec [i] :=arrow(subs (u=2*Pi*i/n,lat) ,ScalarMultiply(e1, 
3*cos (-2*Pi*i/n*sin(v0))/4)+ScalarMultiply(e2, 
3*sin(-2*Pi*i/n*sin(v0))/4) ,color=blue,width=[0.04], 
head_length=[0.2]); 

od; 

vecc:=convert(vec,set): 

sphere :=plot3d(sph, u=0..2*Pi,v=-Pi/2..Pi/2,shading=xY, 
grid=[25,25]): 

WV:= vecc union {sphere}: 

display (VV,scaling=constrained, orientation=[ori1,ori2]); 
end: 


The following two commands produce the pictures of Figure 6.3. Note that the only difference is 
the viewing orientation. 


> holosph(Pi/6,15,0,-21); 
> holosph(Pi/6,15,82,28); 


Now, by Exercise 6.3.12 and Exercise 6.3.13, for a specified latitude circle on the sphere, a 
cone can be determined that has the same holonomy around a parallel circle as the sphere does 
around the latitude. Let’s use Maple to see how such a cone is found. First, we define some 
notation. 


> sinphi:=1/sqrt(1+a~2) ; 
cosphi:=simplify(sqrt(1-sinphi~2) ,symbolic); 


sinphi = rary 
cosphi := . 
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Now we write the parametrization of the cone as in Exercise 6.3.12. 


> cone_a:=<u*sinphi*cos(v/sinphi) |u*sinphi+sin(v/sinphi) | 
u*cosphi>; 


ucos(v¥1+a?) usin(vV1 +a?) ua 
VIi¢a ° Vi+a ° Vi+a@ 


cone = — 


Our standard “EFG” metric procedure verifies that the metric is independent of our choice of a. 
We also use the “UN” procedure to find a unit normal. 


> EFG(cone_a); 
[1, 0, v7] 
> U:=UN(cone_a); 


ee _acos(v v1 + a’) _asin(v V1 +a’) 1 
Ji¢a Ji¢@ ° Vita 


If we take the line through a point on the edge parallel circle of the cone in the direction of the 
unit normal, then this line should pass through the desired center of the sphere whose latitude 
corresponds to the circle. The line is parametrized by cone,g(uo, vo) + t U(uo, vo). The center of 
the sphere lies on the z-axis, so we can find the appropriate value 7 that ensures this. We use 
Maple’s “solve” command to find where the x-coordinate is zero. 


> T:=solve(cone_a[i]+t*U[1]=0,t); 


u 
CSS 
a 


Now we can substitute the value T into the third coordinate of the line parametrization to find 
the center of the desired sphere. 


> z_coord:=simplify(cone_a[3]+T+U[3]); 


uJV1l+a? 


a 


z.coord := 


Of course, this is the same as u/ cos(@). The radius of the sphere can be determined by calculating 
the Euclidean distance from the point on the cone to the center just found. 


> rad:=simplify(VectorNorm(cone_a-<0]0|u*sqrt(ita*2)/a>, 
Euclidean, conjugate=false) ,symbolic) ; 


u 
rad := — 
a 
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Of course, this is simply u sin(p)/ cos(¢). The following commands plot the cone and sphere 
seen in Figure 6.4. 


> coneplot:=plot3d(eval(cone_a,a=1) ,u=0..2, 
v=0..2*Pixeval (sinphi,a=1) ,grid=[5,25] ,shading=zhue) : 

> sphereplot:=plot3d(<eval (rad, {a=1,u=2})*cos(u) *cos(v) | 
eval (rad, {a=1,u=2})*sin(u) *cos(v) |eval (rad, {a=1,u=2})*sin(v)+ 
eval (z_coord, {a=1,u=2})>,u=0..2*Pi,v=-Pi/2..Pi/2,style=patch, 
shading=XY) : 

> display({coneplot,sphereplot},scaling=constrained, 
orientation=[0,90] , lightmodel=light3) ; 


Now we come to the culmination of our attempt to understand holonomy using Maple. We have 
seen that the holonomy around a latitude circle on the sphere is equivalent to that for a parallel 
circle on an appropriate cone. So if we understand the origin of holonomy on a cone, then, 
in principle, we understand the sphere as well. Now, we saw in Theorem 6.3.5 that isometries 
preserve holonomy. We also saw in Exercise 5.5.5 that a cone may be unrolled isometrically 
to a pie wedge in the plane. Therefore, parallel vectors in the plane must be carried to parallel 
tangent vectors on the cone under the unrolling process. Maple is perfectly suited to displaying 
the evolution of these tangent vectors. 


> with(plots) :with(LinearAlgebra) : 


We first verify that the unrolling map gives isometries at each stage t. Namely, we test that the 
metric coefficients do not change. 


> EFG(<u/sqrt (1+t*a*2) *cos(v*sqrt (1+t*a*2) ) | 
u/sqrt (1+t*a*2) *sin(v*sqrt (1+t*a%2) ) | 
atu/sqrt (1+t*a*2)*sqrt(t)>) ; 


[1, 0, u7] 


The following procedure unrolls the cone isometrically along with tangent vectors along a 
boundary curve. The inputs are: ft, the stage of the unrolling; a, the cone parameter; n, the number 
of tangent vectors; vl, v2, the coordinates of the original vector; and orientation angles. The 
idea is this. Input a tangent vector in the plane and recognize that it can be written as a linear 
combination of x, and x, (along the boundary curve) since they form a basis for the tangent 
plane. Thus, we have 


holovec = Ax, + Bx, 


for coefficients A and B determined by 


x, - holovec X, - holovec 
A = ———_, B= ———_.. 
Xu ° Xy Xy° Xy 
In fact, we can take a series of vectors parallel to the original one and find coefficients with 
respect to the original (planar) x,,, x, for each. But we know that isometries preserve lengths and 
angles, so since the original x,, x, (denoted xu0, xv0) go into parameter tangent vectors (denoted 
xut, xvt) at every stage, the coefficients A and B remain unchanged! Hence, we can use them 
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together with the basis vectors xut, xut to create the evolved tangent vectors. Since isometries 
preserve holonomy, the evolved vectors behave exactly as the original planar vectors do. The 
procedure plots the evolved cone and evolved tangent vectors. 


> holounroll:=proc(t,a,n,vl,v2,ori1,ori2) 

local ys,yt,boundary_circle,i,tani,vec,y0,xu0,xv0,xu0i,xv0i, 
xut ,xvt,A,B,holovec,newholovec,cone,vecc,VV; 

ys:=s-><u/sqrt (1+s*a*2) *cos (v*sqrt (1+s*a*2)) |u/sqrt (1+s*a*2) 
*sin(v*sqrt (1+s*a°2)) |a*u/sqrt (1+s*a72)*sqrt(s)>: 


yt:=ys (t); 
boundary_circle:=subs({u=3},yt): 
y0:=ys (0): 


xu0:=subs(u=3,map(diff,y0,u)); 

xv0:=subs (u=3,map(diff,y0,v)); 

xut :=subs (u=3 ,map(diff,yt,u)); 
xvt:=subs(u=3,map(diff,yt,v)); 

holovec:=<v1|v2|0>; 

for i from 0 to n do 

xu0i:=subs (v=2*Pi/sqrt (1+a72)*i/n,xu0) ; 

xv0i:=subs (v=2*Pi/sqrt (1+a*2)*i/n,xv0) ; 

A:=DotProduct (xu0i, holovec, conjugate=false)/VectorNorm(xu0i, 
Euclidean, conjugate=false)~2; 

B:=DotProduct (xv0i, holovec, conjugate=false) /VectorNorm(xv0i, 
Euclidean, conjugate=false) “2; 
newholovec:=ScalarMultiply (subs (v=2*Pi/sqrt (1+a*2)*i/n,xut), 
A)+ScalarMultiply (subs (v=2*Pi/sqrt (1+a72)*i/n,xvt) ,B); 

vec [i] :=arrow(subs (v=2*Pi/sqrt (1+a*2)*i/n, boundary_circle), 
newholovec, color=blue ,width=[0.08], head_length=[0.2]); 

od: 

vecc:=convert (vec,set): 
cone:=plot3d(yt,u=0..3,v=0..2*Pi/sqrt (1ta"2) ,shading=XY, 
grid=[5,25]): 

VV:=vecc union {cone}: 

display (VV, scaling=constrained, orientation=[ori1,ori2]); 
end: 


The following sequence of commands produces Figure 6.12. The evolution proceeds from left to 
right and then down. 


> holounrol1(0,sqrt(3),15,1,1,-89,0); 

> holounroll(.2,sqrt(3),15,1,1,-118,70); 
> holounroll(.4,sqrt(3),15,1,1,-80,73); 
> holounroll(.6,sqrt(3),15,1,1,-56,76) ; 
> holounroll(.8,sqrt(3),15,1,1,-56,78); 
> holounroll(i1,sqrt(3),15,1,1,-4,62); 


The following command animates the evolution of the parallel vectors. 


> display (seq(holounroll(i/20,sqrt(3) ,15,1,1,-15,50), 
i=0..20),insequence=true) ; 
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Figure 6.12. Evolution of parallel vectors 


In Figure 6.12, we see the tangent vector rotate by x. What is the reason for this? Recall from 
Exercise 6.3.12, that the holonomy around the boundary curve of the cone is 27 sin(@), where 
sin(p) = 1/1 +.a?. The commands above have a = V3, so we obtain sin(@) = 1/2. That is, 
@ = 1/6 and holonomy is 7, Note that this means that the cone above corresponds to the —7/6 
latitude circle on a sphere. 


Exercise 6.9.1. Consider the pie wedge of Figure 6.12. Show that the holonomy there is also 
27 sin(@). Hint: compare the angle of the tangent vector with the u-parameter curve (i.e., a 
meridian). 


The Calculus of Variations and 
Geometry 


7.1 The Euler-Lagrange Equations 


In previous chapters, we have seen that various geometric entities have a tendency to minimize 
some quantity. For instance, geodesics try to be paths of shortest arclength while minimal surfaces 
(including their physical representatives, soap films) try to be surfaces of least area. Of course we 
have also seen that there are geodesics which do not minimize arclength and minimal surfaces 
which are not least area. In this chapter we will try to put these results into perspective by 
giving a quick tour of the calculus of variations — the subject whose principles embody this 
geometric (and physical) tendency toward minimization. Now, whole books have been written 
about this subject (see [Sag92, Wei74, Pin93] for example) and the subject itself is rife with 
essential technicalities, so we shall stick with what true variationalists call the naive theory. 
Nevertheless, we shall see that variational principles and methods are intimately connected to 
geometry. 

Here’s the general set-up for the calculus of variations: let x = x(t) denote a function of t 
with fixed endpoints x(to) = xo and x(t,) = x;. See Figure 7.1. Because of this picture, we often 
refer to x(t) as a curve joining the endpoints. Indeed, the function x may, in fact, be a curve 
in n-space, so it pays to keep in mind this possibility. In particular, we might like to think of a 
curve x(t) = (x'(r),..., x"(t)) as being the path of a particle with respect to time f. This is the 


X(tq) x(t) 


Figure 7.1. Curve with fixed endpoints 
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mechanics point of view. One potential source of confusion must be mentioned right away. While 
we have used the standard mechanics notation x = x(t) = (x!(t), ..., x"(f)) above, when we deal 
with the geometry of the plane, we will use the coordinates which most of us feel comfortable 
with. Namely, we will write the independent variable as x and the function in question y = y(x). 
The context of the problem should make everything clear. 

Now, there are many choices of curves x(t) joining the given endpoints. It is only when we 
add some sort of condition to be satisfied that we can pick special x(t) out of this collection. For 
example, we have the 


Definition 7.1.1 (Fixed Endpoint Problem). Find the curve x = x(t) with x(t) = xo and x(t;) = 
x, such that the following integral is minimized 


J= if ; f(t, x(t), X(t) dt 


where f(t,x, x) is a function of t, x and * =dx/dt and the latter two are thought of as 
independent variables. 


Example 7.1.2 (Hamilton’s Principle). 

Let T = 1/2 mx? denote the kinetic energy of a particle moving along the x-axis. A typical 
potential energy function depends on the distance of the particle from a specified point x9, so we 
can write V = V(|x — xo). As we shall see later, Hamilton’s principle says that the motion of 
the particle x = x(t) will be such that the integral 


y= fr—vai= f jami?— vex xo) at 


is minimized. (In fact, as we shall see below, this is not quite correct. Hamilton’s principle only 
requires that the integral be extremized.) Because this example is so important, it is traditional to 
keep the dot notation x for derivatives in even more general situations. 


Example 7.1.3 (Shortest-Distance Curves in the Plane). 
We have seen before that the shortest distance between two points in the plane is attained by a 
straight line. If we use x y-coordinates, then the problem of determining the curve y = y(x) of 


minimum arclength is simply to 
Minimize fv 1+ y" dx. 


In this case, f(x, y, y’) = V1 + y”. Also, note that here we have used the usual prime notation 
for derivatives. 


To approach the fixed endpoint problem, we should first recognize that, just as in ordinary 
calculus, our methods are best suited to finding /ocal minima, not global minima. In order to 
recognize a minimum x(t), just as we do in ordinary calculus, we take the derivative of J with 
respect to € and note that, since x(f) is a minimum by hypothesis, this derivative is zero for 
€ = 0. A crucial point is that we can take the derivative inside the integral due to the following 
result. 
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Lemma 7.1.4. [f f(€, y) and af(e, y)/d€ are continuous, then 


d b b a : 
=| fe dy = f ae» Y ay. 


Proof. Let h(z) = fe f(z, y)dy. We want to show that the €-derivative of the integral in question 
is h(e). Using the fact that we can switch the order of integration in a double integral (i.e., by 
Fubini’s theorem), we get 


E h(e)dz = y ip fale, y)dy dz 
-[f fz, ydz dy 


-{ fle ydy~ f fc, y)dy. 


The Fundamental Theorem of Caiculus then gives 


h(e) = 5 ff meac 
d f? d f° 
= =| fle dy- = f fle, y)dy 


d b 
= =f fle. y)dy 


since the second term does not depend on ¢. This is of course exactly what we wanted. O 


With this in mind, let’s start to analyze the problem. Suppose x(t) is a curve which minimizes 
the integral J = vis F(t, x(t), x(t)) dt and let x*(t) = x(t) + € n(t) be a variation of x. That is, 
we think of € as being small and we require that (to) = 0 and n(t;) = 0. Therefore, the curve 
x*(t) still joins x9 and x, as well as being “close” to x. Note too that x* = x + € 7. With this 
notation, we can think of the integral J as a function of the parameter € 


fh th 
xe)= | fi.xt.a)d = f f(t,.x+en,e+en)adt. 
fo to 


In order to recognize the minimum x(t), we take the derivative of J with respect to €. 


dJ al af eae af ax* 
de J, ax* de ax* de 


by the chain rule, so 


dJ " 9@ a 
=} f Fat 
fo 


de Saxe" + age 
with 


dJ 
0=— 


"@ a 
=F oF ah OL ath 
e=0 to x 
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since, ate = 0, x*(t) = x(t). Now, the second term inside the integral may be integrated by parts 
as follows. Let 


eek 


=—, dv=nadt 
ax ewe 


and compute 


ty a 
/ af dt= 
fo ax 


eee ie 
ro to "at 5 (gr) # 


t9 


-- fra (3) @ 


since n(t9) = 0 = n(t;) implies the vanishing of the first term. Putting this in the equation above, 


we have 
n9@ " d fa 
om fa [va Ge) # 
ty OX to "at ax 


" Tar 4d af 


This equation must hold for every function n with n(fo) = 0 = n(t,). How can this be? The 
following exercise provides an answer. 


Exercise 7.1.5. Suppose that a continuous function y = f(x) Is positive at a point x9. Show that 
it is possible to choose a function n(x) so that on some interval [a, b], 


b 
| n(x) f(x)dx > 0. 


a 


Hints: (1) draw a graph of f(x) about x9 and say something about the values of f(x) for all points 
xX near Xg; (2) create a “bump” function n(x) which guarantees the positivity of the integral above 
by drawing its graph on the same axes as the graph of f(x) near xg; (3) remember that positive 
integrands produce positive integrals. 


The exercise tells us that there is only one way equation (*) can hold for all choices of 7. Namely, 
we must have 

af  d (af 

ax dt \ax 


This equation is called the Euler-Lagrange equation and it gives us a necessary condition for x(t) 
to be a minimum. 


Theorem 7.1.6. /f x = x(t) is a minimum for the fixed endpoint problem, then x satisfies the 
Euler-Lagrange equation. 
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Notice that we are not saying that a solution to the Euler-Lagrange equation is a solution to the 
fixed endpoint problem. The Euler-Lagrange equation is simply a first step toward solving the 
fixed endpoint problem, akin to finding critical points in calculus. Nevertheless, because we deal 
with an unimaginably huge collection of possible solution curves for the fixed endpoint problem, 
the Euler-Lagrange equation is a powerful tool which is indispensable. Indeed, it is sometimes 
the case that only solutions to the Euler-Lagrange equations may be found with little or no other 
information to guide us to a solution of the fixed endpoint problem. For this reason solutions 
to the Euler-Lagrange equation are given the special name extremals and the fixed endpoint 
problem, for example, is often rephrased to say that a curve x(r) is desired which joins the given 
endpoints and extremizes the integral J. In this case, solutions to the Euler-Lagrange equation 
solve the problem. 


Exercise 7.1.7. Suppose that J depends on two functions x(t) and y(t), 


J= / "f(t. x(0), y(t), £0), WO) dt. 


Show that variations x* = x + €n and y* = y + eT lead to 


dJ 
0=— 
de |, 


perce af ay 
ae dx de dide dyde JY dE 


-f%, af, , oF, af 


dt 


pe peat 
set ae ay aye 


" 9 d [a " Ta d (a 
“fol alle t-Ls-a(s)]# 
to ax dt \ax a dy dt \dy 

and that this equation can hold for all 7, t if and only if the following Euler-Lagrange equations 
hold: 

af  d (of Li af  d (af 

ax dt \ ax , dy dt \ay 
Exercise 7.1.8. Suppose that J depends on two independent variables t and s; 


J = ff F.s.x0.s).(t.9),t s)paras 


Show that a variation x*(t, 5) = x(t, s) + €n(t, s) with y|¢ = 0, where C is the boundary of the 


region of integration R, leads to 
a a a 
if eee iia Lodi 
Ox Ox; Ox, 


Further, recalling Green’s theorem f —Pdt+ Qds = f{ [ 9Q/dt+4P/ds dt ds, letting Q = 
n(af/dx,), P=n(df/dx;) and using nlc =0, show that the last two terms of the 
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of af 
— ; dtds = dtd 
[fF a Pie ce -f fr | = =| . 


and substitution then gives 


af af 
O= panel oe dt ds. 
| ik E ax, 0t a ‘ 


Finally, argue that this implies that the Euler-Lagrange equation for two independent variables 


is 
of a fof of 
dx aor (34) - as ~ (52) = 
Exercises 7.1.9. Suppose f(t, x, x) = f(x, x) does not depend on t explicitly. By this, we mean 
that f only depends on ¢ through the curves x and x, so that 9f/dt = 0. Show that a non-constant 
x(t) satisfies the Euler-Lagrange equation if and only if 
af 


er heals 


where c is a constant. This equation is said to give a first integral for the problem. Hint: compute 
the derivative with respect to t of the left-hand side. Don’t forget the chain rule. 


integral give 


Exercise 7.1.10. Suppose f(t, x, y, x, y) = f(x, y, x, y) does not depend on ¢ explicitly. By 
this, we mean that f only depends on ¢ through the curves x, y, x and y, so that df/at = 0. Show 
that non-constant x(t), y(t) satisfy the Euler-Lagrange equations if and only if 


where c is a constant. Hint: compute the derivative with respect to t of the left-hand side. Don’t 
forget the chain rule. 


Exercise 7.1.11. Show that, if af/dx = 0, then x(t) satisfies the Euler-Lagrange equation if and 
only if df/dx = c, a constant. 


Exercise 7.1.12. Show that the integrals 


[ foxinat ana [50 Paes ech a aay 


have the same Euler-Lagrange equation. In particular, note that when g(t, x) = ct, withc constant, 
then f f dt and f f +c dt have the same Euler-Lagrange equation. We will use this fact when 
we discuss Jacobi’s theorem, Theorem 7.7.6. 


Exercise# 7.1.13. Find the extremal x(t) for the fixed endpoint problem 
4 
] 
J = x sint+~x?dt 
0 2 


with x(0) = 0, x(7) =z. 
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Exercise 7.1.14. Find the extremal x(t) for the fixed endpoint problem 
1 
J= i Exe +x? xdt 
it) 
with x(0) = 1, x(1) = 4. 


Exercises 7.1.15. Find the extremal x(t) for the fixed endpoint problem 


{2 
J = x? dt 
0 
with x(0) = 0, x(7/2) = 1. 


Exercise 7.1.16. Here is a type of problem where it is possible to identify an extremal as a 
minimizer (as well as calculate the value of J in a simple way). Consider the fixed endpoint 
problem with x(to) = xo, x(t;) = x; and integral to be minimized 


th 
J = pxvrtq xd 


to 


where p = p(t) and g = q(t) are arbitrary smooth functions. 


(1) Show that if x = x(t) is an extremal for J, then J = ea = p> x X(t) — p? x X(to). 
Thus the value of J is easy to calculate for an extremal. 


(2) Show that ifx = x(t)is an extremal for J and n = n(t) is a function with n(to) = 0 = n(t,), 
then 


~ h 

J =] p’xn+q°xndt =0. 
to 

(3) From (1) and (2) show that an extremal x(t) for J is a solution to the fixed endpoint 
problem. That is, x(t) minimizes J. 


Hints: For (1), differentiate x p? x with respect to ¢ using the product rule on (x)(p? x) as indicated 
and remember that x is an extremal. For (2), compute f p? x1 dt by parts, remembering that 
x is an extremal, or review the derivation of the Euler-Lagrange equation. For (3), vary x by 
x +n (where the usual € is absorbed into 7) and expand the integrand of 7 below, noting that 
J = J +. What can you say about Y? 


a on 
F=f Parise etna 
fo 
Exercise 7.1.17. In the fixed endpoint problem, suppose the time f; is fixed (as well as to and x(to) 
of course), but the final position x(t;) is undetermined. From the derivation of the Euler-Lagrange 
equation, show that not only the Euler-Lagrange equation must be satisfied, but that the extra 
condition 
af 


ag = 
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must be satisfied as well. Hint: everything in the derivation works as before except that, after 
integrating by parts, the first term no longer vanishes. Argue that the Euler-Lagrange equation is 
satisfied because the eventual extremal may be considered as an extremal for the fixed endpoint 
problem with x(t) being whatever point the extremal hits at t = fr). 


Remark 7.1.18. The preceding exercise hints that not all variational problems have fixed end- 
points. In fact, there are many problems where either the final “time” rt, or the final “state” x(t)) 
are undetermined. As the exercise suggests, in these cases the Euler-Lagrange equation holds 
together with an extra condition which is sometimes called a transversality condition. This is the 
subject of the next section. 


7.2 Transversality and Natural Boundary Conditions 


Many problems involving the calculus of variations don’t fall under the exact guidelines of the 
Fixed Endpoint Problem. One simple generalization is the following. 


Definition 7.2.1 (Endpoint-Curve Problem). Find the curve x = x(t) with x(to) = xo and x(t) 
lying on a curve @ such that the following integral is minimized. 


fh 
= | f(t, x(t), x(t)) dt 
lo 
Here, it may be the case that rt; is not specified. 


In fact, the situation of the last sentence of the definition arises so often that we give it its own 
definition and name. 


Definition 7.2.2 (Undetermined Time Problem). Find the curve x = x(t) with x(to) = xo and 
x(t)) = x; for some t; such that the following integral is minimized. 


fh 
J =f F(t, x(t), x(t)) at 
to 
Here f, is not specified. 


We will consider the Endpoint-Curve Problem and then take a special case to answer the 
problem posed in Definition 7.2.2. As usual, let x(t) denote a solution to the problem and 
suppose x*(t) = x(t) + €n(t) is a variation of x(t). Now, however, we only restrict 7 by requiring 
n(to) = 0. Therefore, x*(to) = xo also. Of course, a hidden requirement is that x*(t\') lie on the 
curve a, where t/ = t; + €&(t;), since f, (the final time for the solution x(t)) may change. So, to 
carry out the differentiation of 


ty tek 
J= | F(t, x, x) dt, 


19 


we must know how to differentiate with respect to a limit of integration as well. This involves a 
refinement of Lemma 7.1.4. 
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Lemma 7.2.3. If f(e, y) and af(e, y)/d€ are continuous and g(e) is differentiable, then 
d ale) 


ale) g ; 
af fend =soresens [ ay. 
mae a dE 


Proof. Let h(e, w) = fe Ff (€, y)dy be a function of € and w. The partials of h are given by 


| FED... MEaFew 


by Lemma 7.1.4 and the Fundamental Theorem of Calculus respectively. The chain rule says that 
the total derivative of # with respect to € is given by 
dh dw 


a =h, hy 
de "= de 


and, since w = g(€), dw/de = g’(e). Substituting w = g(€) then gives the desired result. O 


Now, once we know how to differentiate the integral when the upper limit depends on €, we 
modify the earlier derivation of the Euler-Lagrange equation. 


dJ T of ax* af a: 
-[ SS+H5 
ty 


dty 
fe 1 oes bee det 


T 9 of. 
-/ big ds f dt + Eft) 
fy Ox* ox* 


with 


4) 
Sse 


ng fe] 
=a wont Hat + 6/th) 
€=0 fo 


since, ate = 0, x*(t) = x(t). Now, the second term inside the integral may be integrated by parts 
as usual to produce 
"@ an) ‘ of 
fo ROE to ns t 


0 " d fa 
= site| n (e ai 


since (to) = 0. Putting this in the integral above, we have 


ng h 9 
a), a A "at (Ge =) dt + n(n oon) + EF) 


n 0 d {da 
= nl - =( =) dt + n(n)o(h) + 8 (0) 


ox dt 


If x(t) is a solution to the broader problem of extremizing J subject to ending on the curve a, 
then x(t) is also a solution to the Fixed Endpoint Problem with fixed endpoint x(t,). Therefore, 
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x(t) satisfies the Euler-Lagrange equation already. The first term in the equation above is then 
zero and we have 


fe] 
mony a(t) + 8/0) = 0. 


Now let’s try to understand the relationship between n and & by using the requirement that x*(r}) 
lies on the curve @ for any tf = t + €&. 

Write the curve a implicitly as a level set g(t, x) = c. Then, differentiation with respect to € 
on both sides gives 


_ dg(tt, x*) 

~ dé 

= BE + ge(xE + + €N€)) 
= g& + g.(x& + n(t)) 


0 


where the last line comes from evaluation at « = 0. We can solve for n(t,) and plug into the 
equation above to get 


ntti) eat +Ef(t)) =0 

xX 

Garae -- a) ein) te pny =0 
8x Ox 


g [a - (228) | = 0. 
g Ox 


x 


Because & may be nonzero, the term in brackets vanishes. The simplified condition then gives 


Theorem 7.2.4 (Transversality Theorem). An extremal for the Endpoint-Curve Problem must 
satisfy the following transversality condition at the ending time t,: 


re) 
P(tyax(t) — [er(tr) + galt ott) = 0. 


Remark 7.2.5. The term transversality comes from interpreting the condition above as a condi- 
tion on the angle of intersection of the curve g(t, x) = c and the extremal curve x(t). To see what 
this geometric condition is, recall that the gradient Vg = (g;, g,) is perpendicular to the level 
curve g(t, x) =c. Also, the extremal curve may be parametrized by (t, x(t)) with tangent vector 
(1, x). Then the dot product gives 


8 + 8rk = Vg - (1,4) = |Valv 1 + x? cos(@) 


where @ is the angle between the gradient Vg and the tangent vector (1, x). Plugging in for 
&: + gx and solving the transversality condition then gives 


Sf (tiex(t) 
Z(nyVa(nlvl + x(n? 


Knowing the angle 6 then allows us to determine the angle of intersection of the curves (since 
we know Vg is perpendicular to the curve g = 0). 


cos(@) = 


7.2. Transversality and Natural Boundary Conditions 321 


Example 7.2.6 (Transversality at Vertical Line). 
Suppose g(t, x) = ¢ and c = t,, so that the curve is a vertical line in the (t, x)-plane. Then g, = 1 
and g, = 0, so the transversality condition becomes 

of 


ag) =0. 


This is the situation when the final time is fixed at t,, but the final position x(t,) is unspecified. 


The transversality condition in this case is sometimes called a natural boundary condition. 


Example 7.2.7 (Transversality at Horizontal Line). 
Suppose g(f, x) = x and c = x, so that the curve is a horizontal line in the (¢, x)-plane. Then 
g; = Oand g, = 1, so the transversality condition becomes 


.,, of 
f(t) — ¥C) a) = 0. 


This is the situation when the final position is fixed at x, but the final time is unspecified. Hence, 
this transversality condition is the extra condition necessary for solving Definition 7.2.2. 


Exercise 7.2.8. Suppose J = i f(t, x(t), y(t), x, y) dt is to be extremized with fixed beginning 
point in 3-space and endpoint required to lie on a surface given implicitly by g(t, x, y) = c. Show 
that the natural boundary conditions for this situation are 


a 
f(tgx(t) — Lgr(ti) + ex (ti) x(t1) + axle ge(A) = 0 


a 
Fta y(t) — [erlt) + 8x(tix(t) + By (x(t nsctn) = 0. 


To show these relations, carry out the proof of Theorem 7.2.4 on each function (i.e., x and y) 
one at a time. That is, first take variations x* and tf and leave y as in the hypothesized extremal. 
Derive the natural boundary condition. Then do the same for y* and tr? leaving x alone. Note 
that when the surface has the form of a graph of a function g(t, x, y) =t — h(x, y), then the 
conditions become 


a 
f(tyhx (th) + (1 — Ax) x(t) — hse ved (0) =0 
a 
f(tDAy(t) + U1 — Ax (4) x(t) — hy(enstedlgn) = 0. 


Exercise 7.2.9. Use the natural boundary conditions of Exercise 7.2.8 to show that an extremal for 
distance from a point to a surface g(x, y, z) = c in 3-space must meet the surface perpendicularly. 
Hint: show that the tangent vector to the extremal is parallel to the normal to the surface given 
by the gradient of g. 


Exercise 7.2.10. Extremize the following integral with x(0) = 0 and x(1) free. 


1 
/ x? 44% 42xe! dt 
0 
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Exercise 7.2.11. Extremize the following integral with x(1) = 0 and x(2) free. 
2 
/ z+Px dt 
1 
Exercise 7.2.12. Extremize the following integral with x(0) and x(1) both free. 
1 
/ Rt xttx+idt 
0 


Exercise 7.2.13. Extremize the following integral where x(0) = | and x(T) lies on the curve 
x(t) =24(t— 1). 


7.3 The Basic Examples 


No discussion of the calculus of variations would be complete without mentioning the problem 
which launched the subject, the brachistochrone. The name is taken from the Greek words 
brachist, which means shortest, and chronos, which means time. The problem itself is this: 


Example 7.3.1 (The Brachistochrone Problem). 

Given a point (a, b) in the x y-plane, find the curve y(x) joining (a, b) and the origin so that a 
bead, under the influence of gravity, sliding along a frictionless wire in the shape y(x) from (a, b) 
to (0, 0), minimizes the time of travel. See Figure 7.2. 

The key to setting up this problem is the simple formula D = RT, distance equals rate 
times time. The “infinitesimal” distance travelled by the bead is just the arclength of the wire, 
D=,/1+y’. The rate of descent may be determined by noting that potential energy is given 
by the formula mgh where m is the mass of the bead, g = 9.8 m/sec? is the acceleration due to 
gravity (near the surface of the Earth) and h is the height of the bead above a fixed reference 


Figure 7.2. Bead on a wire 
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height (which is often taken to be zero on the Earth’s surface). If we start the bead at (a, b) with 
initial velocity zero, then conservation of energy requires that the kinetic energy of the bead be 
equal to the loss of potential energy due to decreasing height. In other words, 


1 
mge(b—y)= 5mu 


where v is the speed of the bead. We then have that v = /2g(b — y). So, D = RT tells us that 
time T is a function of y and y’, 


(62 
VI) = Se: 
V2g Jb—-y 
The total time is then found by integrating T(y, y’) with respect to x from a to 0. The problem of 
the brachistochrone now becomes the fixed endpoint problem 


Minimize T — — poe. 
Z SS —— oo AX 
V28 Ja Vb-y 


with y(a) = b and y(0) = 0. 
To solve this problem, first substitute u = b — y and ignore the constant factor 1/,/2g to get 
a new integral to be extremized, 


y= f Oa 
a Vu 


Because the independent variable does not appear explicitly in J, we may use Exercise 7.1.9 to 
get 


vi+u?r , u’ 
u 
Ju Vu(l +u’) 


Now, finding a common denominator, simplifying and replacing 1/c by a new constant c gives 
a separable differential equation c = u(] + u’”) which leads to 


+= | ve du 


C—-u 
=-2 f Je =wraw 


where w = ./c — u and c is replaced by c?. So, with w = csin@, 


x= -2 f ceos0 cose a8 


= —2c¢? ‘ cos’ 6 d@ 


20 
258 i, ae 8 


e 
= — 7 20 +sin26)+r 


x(b) = k(@+ sing) +r 
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where 20 = ¢. Now, b— y =u =c? cos*6 = c* cos* £ by the substitutions above, so again 
using cos? £ = (1 + cos ¢)/2, we obtain 


y(@) = k(1+cos¢?)+b 


where k = ~c*/2 as for x(#). Note that y(2) = b, so that it must also be true that x(7) =a. 
This implies that r = a — km. If we let € = ¢ — 7, then we obtain the parametric formulas for x 
and y, 


x(€)=k(€ —singé)+a and y(¢)=k(1 —cosf)+b. 


This is the parametrization of a cycloid. Hence, the solution to the problem of the brachistochrone 
is one of our standard curves from Chapter 1. Also, note that, by Exercise 1.1.14, the brachis- 
tochrone is the tautochrone as well. For a Maple approach to the brachistochrone, see [Opr00]. 
Finally, for the brachistochrone with friction, see [HK95] and for a very different approach, see 
{[Law96]. 


Example 7.3.2 (The Brachistochrone to a Line). 
What if we alter the brachistochrone problem Example 7.3.1 to ask for the curve from a point to 
a vertical line such that a sliding bead takes the least time? The integrand f in the integral to be 
minimized is 

1+y? 

v=y 
where we now use x, y and dy as variables. The transversality condition from Example 7.2.6 is 
then 


af x 
0= (0) = 
dy Vl+y?J/b—y 


which says that y’ = 0 in the (x, y)-plane. That is, the brachistochrone to a vertical line must 
come into the line horizontally (i.e., at a right angle). 


Exercise# 7.3.3. Johann Bernoulli solved the brachistochrone problem ingeniously by employing 
Fermat’s principle that light travels to minimize time together with Snell’s law of refraction (see 
[Wei74] and [Opr00]). Here’s a problem in the same vein. Suppose that a light photon moves in 
the upper half-plane to minimize time. Suppose also that the plane is made of a medium whose 
properties imply that the speed of the photon is always proportional to its height above the x-axis. 
That is, v = k y for k > 0. What path does the photon take? Hint: set up the time integral and 
extremize it. 


Example 7.3.4 (Shortest Distance Curves in the Plane). 
Given points (a, b) and (c, d) in the plane, we can ask for the curve x(t) which joins them and 
which minimizes the arclength integral 


J= i Vx7(t)+ y(n) dt. 
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Here we have chosen a two-variable formulation to show how these types of problems work. 
Because the integrand does not depend on either x or y, the Euler-Lagrange equations reduce to 


af x af y 


We i a a gre d = - 
a JPOtrn a9. Jerr 


Solving for the square root in each equation and setting the results equal, we obtain 


yorxt+r. 
c 


This is, of course, the equation of a straight line in the plane. Therefore, extremals of the arclength 
integral are straight lines. We will see shortly that these extremals are, in fact, minimizers for 
arclength. 


Exercise 7.3.5. Solve the shortest distance in the plane problem using one variable. That is, 


arclength is given by f ,/1 + y(x)’ dx. 


Example 7.3.6 (Shortest Distance from a Point to a Curve). 

Suppose we wish to minimize the distance from a fixed point to a curve. What condition does 
Theorem 7.2.4 impose? The distance is given by an arclength integral J = a Vi+y"dx 
(where we think of the extremal curve as parametrized by (x, y(x)). The target curve we again 
write in implicit form as g(x, y) =c. The transversality condition (in xy-coordinate form) in 
Theorem 7.2.4 becomes (at x;) 


,oV1+ y? 
Vl+y?gy — [gs + 8,¥]—-—— =0. 


dy’ 
Carrying out the differentiation and simplifying, we get 
y= &, 
8x 
Now, y’ is the slope of the tangent line to the extremal at the intersection point of the curve and 
the extremal and g,/g, is the slope of the line determined by the gradient (g,, g,). Therefore, the 
extremal meets the target curve in the direction of the gradient which, we know, is perpendicular to 


the tangent line of the curve. Hence, the extremal for distance meets a target curve perpendicularly 
in general. 


Example 7.3.7 (Least Area Surfaces of Revolution). 


We have already seen that a minimal surface of revolution is a catenoid, so, since least area surfaces 
are minimal, this example offers another approach to the question. A surface of revolution has 


surface area given by 
A=2n [ov 1+ y?dx. 


Exercise 7.3.8. Use the fact that the integrand of the surface area integral above is independent of 
x to reduce the Euler-Lagrange equation to y’ = ,/ y* — c?. Then solve this separable differential 
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equation to show y = c cosh(x/c — d), verifying that a catenoid is the only minimal surface of 
revolution. We have already seen that a catenoid is least area for a given boundary only under 
certain circumstances, so this example shows again that extremals are not always minimizers. 


Exercise 7.3.9. Let z = f(x, y) be a function of two variables. Surface area is given by the 


integral 
A= [ fis R+ Raray. 


Use the two-independent-variable Euler-Lagrange equation of Exercise 7.1.8 to show that an 
extremal for this integral is a solution to the minimal surface equation. 


Exercise 7.3.10. Let z = ¢(x, y) be a function of two variables. The Dirichlet integral is given 


by 
A= | [e+e axay. 


Use the two-independent-variable Euler-Lagrange equation of Exercise 7.1.8 to show that an 
extremal for this integral is a harmonic function. Dirichlet’s integral arises in many areas of 
physics and engineering (see Exercise 4.5.5). In particular, the two-variable version above could 
represent the capacity (per unit length) of a cylindrical condenser with annular cross-sections 
and the analogous three-variable version could represent the potential energy associated with an 
electric field. For the latter, a stable equilibrium is attained when potential energy is minimized, 
so a minimizer for the Dirichlet integral is identified with the potential for the field. For more 
information see [Wei74]. 


Example 7.3.11 (Hamilton’s Principle Again). 
Hamilton's principle is a principle of mechanics which states that the equations of motion of a 
physical system may be found by extremizing the so-called action integral 


j= [r-var 


where T and V represent kinetic and potential energy respectively. Of course T and V may take 
many forms depending on the problem at hand. To understand why such a principle might have 
been developed, let’s extremize the action for the particular one-dimensional case of a single 
particle moving along the x-axis under the influence of a conservative force field F. Recall that 
this means that F is the (negative of the) gradient (which here reduces to a single partial derivative) 
of some potential function V(x), F = —grad V = —dV/dx. (The negative sign is traditional in 
physics.) The particle has kinetic energy T = (1/2)mx?, so the action integral becomes 


1 
J= i: 5m — V(x) dt. 


The Euler-Lagrange equation for this integral is then 


aVv(x) ds. 
an oe an) = 0. 


7.4. Higher-Order Problems 327 


Replacing the first term by F and carrying out the differentiation in the second term gives 
Newton’s Law 


F=mi. 


Furthermore, notice that the integrand T — V does not depend on f, so we may use the first 
integral f — x(df/dx) = c of Exercise 7.1.9 to get 


oT -—V 
pope x 
Ox 
1 
—mx* —V —xmx=c 
2 
1 
—zmi'- Vc 
1 24Vae 
= =—C 
5m 
T+V=c 
E=¢ 


where E = T + V is the total energy of the particle. This calculation means that the total energy 
in a conservative force field is actually conserved (i.e., is a constant). The fact that both Newton’s 
Law and the conservation of energy may be derived by Hamilton’s principle indicates why it is 
a cornerstone of classical mechanics!. We will see the connection between Hamilton’s principle 
and geometry in Section 7.6 below. 


7.4 Higher-Order Problems 


7.4.1 A Higher-Order Euler-Lagrange Equation 


Not all variational problems have integrands f(t, x, x) restricted to x and its first derivative x. 
In particular, problems which involve how objects curve or bend often involve second derivative 
terms related to the curvature of the object. We can also find an Euler-Lagrange equation for this 
case. While this may be done for any number of derivatives, we shall confine ourselves to the 
case of an integrand f(t, x, x, <) (or f(x, y, y’, y”) in geometric notation). 

Suppose x(t) is a curve which minimizes the integral 


J= [ F(t, x(t), X(t), X(t) at 


and let x*(t) = x(t) + € n(t) be a variation of x. We require that (to) = 0, n(t,) = 0, (to) = 0 
and 7(t;) = 0. Therefore, the curve x*(t) still joins x9 and x; and points in the same direction as 


'As noted in [Wei74], however, it should not be thought that Hamilton’s principle is somehow more fundamental 
than Newton’s Law. The very definition of mass is a consequence of F = ma and mass is certainly needed to form the 
integrand T — V of the action integral. 
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x(t) at the endpoints (since x* = x + € 4). As usual, we have 


u n 
J(e)= i S(t, x*, x*, x") dt = | f@,xtenxt+en, &+eRH)dt. 
tg fo 
Now, we take the derivative with respect to € to get 


dJ -[" af ax* af ax* af ax* 4 
de J, Ox* dE  Ax* BE —-AK* BE 


= af, oF, oF, 


dt 
axe age t age” 


with 
= dJ 
~ de 


n@ a 
= ieee +a jdt 
e=0 to 


since, at € = 0, x*(t) = x(t). The second term inside the integral may be integrated by parts as 
usual while the following exercise shows how to deal with the third term. 


Exercise 7.4.1. Integrate the third term by parts twice to obtain 
"of. ee) ae of | d@? (af 
dt =) 7 - —(—] dt 
[ gg = | Mog i nis (35 yeh wa (55) 


Now note that the requirements placed on 7 at the endpoints eliminate the first two terms. 
Therefore, when we incorporate this into the usual calculation, we get 


ef OR se OF d? (of 
o= [ool ~ 5 (2) +53 (Ge) 


This equation must hold for every function 7 with n(to) = n(t1) = 0 = nto) = A(t). As before, 
we must have the following Euler-Lagrange equation for second-order problems. 


Theorem 7.4.2. An extremal x(t) for the variational problem 
qu 
Extremize J = i F(t, x, x, ¥)dt 
to 
with x(to) = Xo, x(t}) = x1, X(to) = Xo and x(t)) = pe Satisfies 
of d af ee a) oe 
ax dt dt? 


Exercise 7.4.3. If, in Theorem 7.4.2, the Lagrangian integrand f is independent of x, then show 
that the Euler-Lagrange equation reduces to 


-2+5(Z)- 


Exercise 7.4.4. Show that extremals for 


ist 
| f(t,x,%,%,...,x™) dt 
to 
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subject to (fori = 1,2...,n) x(t) = x;, X(t) = 4,...,x(t)) = x” satisfy the general higher- 
order Euler-Lagrange equation 


af d (af af r af 
£-5(% f\+ F(z f).. +d (25) = 


Before we can treat the next example, let’s recall a few things about the curvature of a plane 
curve. Let a(t) = (x(t), y(t)) be acurve in the plane with tangent vector a(t) = (x(t), y(t)). From 
Exercise 1.3.2, the curvature of a is given by « = |d@/ds|, where 6 is the angle between a’s 
tangent and (1, 0). For a non-arclength parameter tf, we must modify the formula. Recall that the 
relation between the parameter ¢ and the arclength parameter s is given by 


= |a(t)|, 

where we use the dot notation for derivative with respect to t. We denote the speed of a(t) by v. 
We then see that the curvature of a is given by 
_|d0|_ |d6jat [doll 

~|ds| |dt|ds |ar|v 
Now let’s compute a certain quantity involving 0 two ways. Recall @ = (x, y). We have 

a@-(1,0)=x and @&-(1,0) = vcos(d), 

so we obtain (using v = //x2 + y2) 


cos(@) = 


x 
Now, to see how d6@/dr arises, differentiate both sides and use sin(@) = \/1 — cos*(6) = y/v to 
get 


Xy —xy 
— sin(@) 7@ a yr 3/2 
y| dé xy — xy 
vidt YOR + y2)3/2 
1}d6| xy — xy 
vidt| (4? + y2)3/2 
ty — 35 


= (2 + y2p?2 
If @ is given by a function, then we can write a(x) = (x, y(x)) with @’(x) = (1, y’(x)) and 


i 


x =1, x” =0, yy? S14 yy". 


Hence, the curvature reduces to 

2h 

“eye 
where we have used the standard primed notation for derivatives with respect to x. Of course, 
this is just a special case of Exercise 1.4.6. Now we can analyze a very practical example where 
variational principles determine a shape. A good reference for problems of this type is [Tro96]. 
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Example 7.4.5 (Buckling under Compressive Loading). 

Suppose a column (which we take to be along the x-axis) has a force P applied to it parallel 
to its length at x = L, the column’s end. Daniel Bernoulli showed by experiment that the work 
per unit length required to bend the column is proportional to the square of the curvature of the 
deformed column. Suppose the plane curve that is the center curve of the column is parametrized 
by (x(t), y(t)). As above, the curvature is given by 

ee 
“Ae? 


so the work due to bending (which is also the potential energy due to strain in the column as it is 
bent) is 


s(L) 
Wena = 1 | kK ds 
0 
L NY2 
= uf Cd ae /} + y? dx 
igs Gee a) 


auf aaa 
o + yh? 


where s = f ¥1+ y’? dx gives ds = 1+ y’ dx. The proportionality constant u = EI/2 is 
determined by Young’s modulus E and the moment of inertia J. Now, there is also work done 
by the force P compressively. This is determined by how much work is necessary to restore the 
curve to its original length. In other words, 


L 
Weomp = P([ Jit y?ar-1) 
0 


where L is the original length of the column. The total potential energy of the buckled column is 


U = Whend — Weomp 


auf Oa e(f Vlt+y?dx- L) 


L y2 
(y") ( 2 ) 
= ——— -P l+y*-—1] dx. 
[ P+ ys y 


The negative sign arises in front of Weomp because the buckled column has in fact length- 
ened, thus moving molecules farther apart creating an “electrical repulsion” potential energy 
deficit. 

Equilibrium is attained when U is minimized, so this becomes a variational problem of higher 
order. Note that the endpoint conditions are y(0) = 0, y(L) = 0 (i.e., the ends are pinned). Also, 
y’(0) = 0 and y’(L) is unspecified (i.e., the “left end” maintains its shape while the compressed 
“right end” deforms). The natural boundary condition for this case may be shown to be y’(L) = 0 
(see Exercise 7.4.9). The higher-order Euler-Lagrange equation from Theorem 7.4.2 (in geometric 
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form using y(x) in place of x(r)) gives (after some simplification) 


rw 


—Spy'y”” + Py'(1+ y?y + Quy" ty) =e ty)”, 


Small deflection theory hypothesizes that the buckled column is still somewhat “close” in shape 
to the original column. This translates into the conditions that |y’| and {y”| are small enough 
so that powers higher than 1 and products of these quantities are negligible. Hence, under this 
hypothesis, the differential equation reduces to 


Py’ 4 Quy” =o 
A solution is found to be (with w* = P/(2)) 
y(x) = ey oo ihe tan(wL) + = tan(wL) cos(wx) — ie sin(wx). 
w? wo wo wo 


Recall that, to find this solution, we split the problem into two parts: finding a general solution 
for the homogeneous differential equation y’”” + wy’ = 0 and then finding a single particular 
solution to the original y’”” + w*y’ = c. The homogeneous equation has associated algebraic 
equation D? + w? D = 0 with solutions D = 0 and D = +iw. Hence the general solution for the 
homogeneous equation is 


Yh = b+ Acos(wx)+ B sin(wx). 


To find a particular solution to the non-homogeneous equation, we usually guess the nght-hand 
side. A constant is part of the homogeneous solution, so we now try the next step up, yp = Kx. 
Computing derivatives of y, and plugging into the equation gives w*K =c and we therefore 
have K = c/w* with y= (c/w*) x. The general solution to the non-homogeneous differential 
equation is then 


; c 
y(X) = yh + yp = b+ Acos(wx) + Bsin(wx) + se: 
w 


The conditions y(0) = 0 and y’(0) = 0 give b = —A and B = —c/w* while the natural boundary 
condition y”(L) = 0 gives 


A =< tan(wL) = —b. 
@ 
The final expression for y(x) is then 
y(x) = = tan(wL) + < tan(wL) cos(wx) — = sin(wx) + =a 
w ) w w 


The condition y(L) = 0 then gives the transcendental equation tan(wL) = wL which may be 
solved for w using Maple. Of course there are an infinite number of solutions corresponding to 
the modes of oscillation of the column, but the first solution is about w;L ~ 4.4934. The first 
critical force for buckling is then given by 


4.4934 \? 
Pie 2pu : 


In other words, no buckling will occur until this force is applied. Also note that, using tan(wL) = 
wL, we can simplify the expression for y(x) when buckling occurs to 


c c c Co 2s 
y(x) = 5x 5b + ZL cos(wx) — a sin(wx). 
@ w wo w 
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Similarly, P,, P; etc. may be found and the higher modes of buckling do not occur unless these 
larger forces are applied. A Maple approach is given in Subsection 7.10.2 and the first three 
buckling modes are depicted in Figure 7.8, Figure 7.9 and Figure 7.10. 


Exercise 7.4.6. When a cantilevered beam of length L is subjected to a distributed load p(x), it 
deflects to a new position described by a function y(x). Elasticity theory says that the potential 
energy of the stable (i.e., equilibrium) configuration is given by 


| 
U(y) = : sul"? = pay dx 


where ps > 0 is called the flexural rigidity determined by the beam cross section. The shape y(x) 
assumed by the deflected beam extremizes U such that 


y0)=0, y(0)=0, y(L)=0, yO(L)=0. 


The last two “natural” boundary conditions arise from the assumptions that, at the free end of the 
beam, the bending moment and shear force are zero. For p(x) = c, a constant (which occurs if 
the beam is deflecting under its own weight), find the deflected shape y(x). (Hint: y(x) is a fourth 
degree polynomial.) 

In fact, the (y”)* in the potential energy integral should be replaced by the curvature squared 
as in the buckled column example. Carry out this replacement and compute the Euler-Lagrange 
equation. Use small deflection theory to obtain the same differential equation to solve as obtained 
using (y”)*. This is why engineers often substitute (y”)? for the curvature — but there is no 
guarantee that this substitution before taking the Euler-Lagrange equation always produces the 
same result as applying small deflection theory afterwards. Now see Subsection 7.10.2 and 
Exercise 7.10.5. 


7.4.2 Higher-Order Natural Boundary Conditions 


In the derivation of the higher-order Euler-Lagrange equation in Subsection 7.4.1, we assumed that 
the perturbation 7 satisfied n(to) = n(t1) = (to) = n(t,) = 0. If we carry through the derivation 
without these simplifying assumptions, we get the following general equation: 


an ale) d (af af " Taf d (af d* (af 
(ie Ge) tas) +f le -5(Z)+5(%) oie 


As before, an extremal for the broader problem is also an extremal for the fixed endpoint (and end 
direction) problem, so it must satisfy the Euler-Lagrange equation. This means that the integral 
in the equation is zero. So we are left with 


"7 af d (af f) 
«lise — tar Ge) az): 


Now let’s look at some typical conditions which may arise. 


0= 


0= 
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Example 7.4.7. (Natural Boundary Conditions). 
The following are typical situations where natural boundary conditions arise. 


(1) Suppose x(to) = xo and x(t) = x) are fixed endpoints. This means that (fo) = 0 = n(t,) 
and the equation above reduces to 


0 0 
lend Ct) = ito) a(t), 


This holds for all 7 which vanish at the endpoints, so in particular for an y with 7(t;) 4 0 
and (to) = 0. This implies 0f/0X(t,) = 0. Similarly an n with 7(t;) = 0 and 7(to) # 0 
implies df /0X(to) = 0. We summarize by writing 


fs] fs] 
Hien Hai {$e tn) Shy y= of 
Xx Ox 


(2) Suppose x(to) = xo and X(to) = Xo. This means that n(to) = 0 and (to) = 0 and the equa- 
tion above reduces to 


fs] 0 rs] 
0= HW eE(n) nen (2 (%)- 2) 


This holds for all suitable 7, so in particular for an n with 7(t,) 4 0 and n(t;) = 0. This 
implies 0f/dX(t;) = 0. Similarly an yn with 7(t,) = 0 and n(t,) 4 0 implies 


5 (2) m= Zo 


(x(t) = xo, £(te) = 40} > {se = =0, 5 al x) (1) = ae 


We summarize by writing 


(3) Suppose (to) = Xo and X(t;) = 21. Then 


{X(fo) = Xo, X(t1) = 1) 


+\5(z 2) co) = Ete, 5 (Z) m= Ze}. (74.1) 


(4) Suppose x(to) = xo and x(t)) = 4,. Then 
‘ ; a a 
{x(to) = Xo, X(t) = 41) > F ta) =0 = a (4) = Ht) 
Ox dt 
Exercise 7.4.8. Verify the natural boundary conditions in Example 7.4.7 (3) and (4). 


Exercise 7.4.9. Suppose x(to) = xo, x(t)) = x; and x(fo) = Xo with x(t) unspecified. Show that 
the natural boundary condition which arises is 


7] 
Fi) =0 


Apply this to Example 7.4.5 to obtain the natural boundary condition y”(L) = 0 used there. 
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Exercise 7.4.10. A thin elastic rod of initial length L is clamped at one end, deflected upward 
and pinned at the other end somewhere on a vertical line x = L. If the rod is described by y(x), 
then the potential energy is proportional to the square of the curvature of y(x), 


_ L y"(xy 
un» | apron 


where 12 is a constant. The physically imposed boundary conditions are y(0) = 0, y’(0) = 0 and 
y(L) = yi. 

a) Extremize U to find a differential equation for y. 

b) Since y’(L) is unspecified, what is the condition imposed by transversality? (Hint: 
y"(L) = 0.) 

c) Assume the small deflection theory conditions that both |y’(x)| and |y”(x)| are very small 
on the interval [0, L]. Find the simplified differential equation for y under this assumption and 
solve it to find 


d.) Graph your answer using Maple by choosing various values for L and yy. 


Now we see how a differential geometric invariant such as the curvature of a curve arises in 
physical problems. Often, it is through the imposition of a physical-geometric principle (e.g., 
Bernoulli’s principle) in the calculus of variations. 


7.5 The Weierstrass F-Function 


So how in the world do we ever know if an extremal is truly a minimizing curve for a given 
integral J? Besides ad hoc means (as in Exercise 7.1.16) there are various sufficiency conditions 
for minimizers which may be used. None of them, however, seems particularly convenient for 
large problems, so we will stick with one which is rather straightforward. This condition is due 
to Weierstrass (whose work on minimal surfaces we’ve already met in Chapter 4). 

Let’s say that x = x(t) is a minimizing curve with fixed endpoints for the integral J = 
f f(t,x,*)dt and ¥ = X(t) is another curve joining the same endpoints. Denote the values 
which J has along the curves x and X by J[x] and J[X] respectively. Then, since x is minimizing 
for J, it must be the case that 


AJ © J] —Jfx] > 0. 


This condition, as it stands, is impossible to check for an extremal x against a comparison curve 
x. So, in order to compare J[X] to J[x], we compare X¥ (or more precisely, its f-value and its 
slope) to other extremals in that particular region of space. Of course, the very existence of these 
extremals is not guaranteed, so we need a definition. 


Definition 7.5.1. For the standard fixed endpoint problem with integral J, a field of extremals is 
a collection of extremails for J (i.e., solutions of the Euler-Lagrange equation) which, in a given 
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region, satisfies the condition that, for each point in the region, there is precisely one extremal of 
the collection passing through it. 


Example 7.5.2 (A Field of Extremals). 
Consider the fixed endpoint problem with x(0) = 0, x(2) = 2 and integral 


2 
1 
iad pi txttex + dt. 
0 


The Euler-Lagrange equation gives a general extremal 


r 
20S 5 tent 


which, upon applying the initial conditions, becomes x(t) = t7/2. A field of extremals may be 
defined by using one of the constants of integration in the formula for a general extremal. In 
this way we are assured of obtaining an extremal for the original J. For example, take the field 
to be 
2 
xa(t) = z +d. 

Now, if two of these extremals, x4, and x¢,, pass through a given (t,x), then #7/2+d, = 
t?/2 + do, resulting in d; = d2 and the equality of the extremals. Therefore, we really do have a 
field of extremals. 
Exercises 7.5.3. For the fixed endpoint problem of Exercise 7.1.13, 

IT , 1 2 

J= x sint+ x‘ dt 
0 2 
with x(0) = 0, x(7) = 2, find a field of extremals. 
Exercise 7.5.4. For the fixed endpoint problem of Exercise 7.1.14, 
1 
J af Ex’ 4+i? xdt 

0 
with x(0) = 1, x(1) = 4, find a field of extremals. 
Exercises 7.5.5. For the fixed endpoint problem of Exercise 7.1.15, 

x/2 

i, x= 5 dt 
0 
with x(0) = 0, x(7/2) = 1, find a field of extremals. 
Once we have a field of extremals for a problem, we can compare the field to any curve joining 


the endpoints. More precisely, given an integral J, endpoint conditions x(to) = xo, x(t;) = x; and 
a curve Xx joining the endpoints, at an arbitrary point (t, x) define p = p(t, x) to be the derivative 
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of the unique extremal in the field which passes through (t, x). Since the function p is defined 
in terms of derivatives of extremals, it called the slope function of the field. In order to compare 
values of J along curves, we make the following 


Definition 7.5.6. Given a field of extremals with slope function p = p(t, x) associated to an 
integral J = f f(t,x, x)dt, the Weierstrass E(xcess)-function is 


) 
E(t, x,X, P) = f(t, x, x) — ft, x, p)-( _ Pants, p). 


Why is this E-function defined the way it is? Here’s one rather informal explanation. Suppose 
we fix (t, x) and consider f as a function of the variable x. Taylor’s theorem for f at the point 
(t,x, p) gives 


a a? 
Flt 5) = $08. D+ PIMC. DG PPS ED 


where the second partial is evaluated at (t, x, £) between (t, x, x) and (t, x, p). If we subtract the 
first two terms of the right-hand side from both sides of the equation, we obtain 


: : a? . 
E(t,x,x, p)=(*- pat r,1). 


If we knew, for example, that the second partial of f with respect to p was always positive, then 
we could say that the E-function was always positive as well. Furthermore, if we integrate the 
E-function (evaluated on an arbitrary curve X), we see that there is a natural splitting 


— ae ~~ ~ an ) ~ 
/ E(t,3,%, p)dt = / FLD f8, p)-@- rent F, p)at 


B a A of 
=f saz.nar-f 70.%.)+@- Hy L0.%, pyar 
P 
= J[x] — K[x] 


where K [Xx] is the integral 


aa ra a 
K[x] =f He. R +O LOR, p)dt. 


Now, 
(1) if E were always nonnegative (so that { E dt > 0 as well) 


(2) and if K(x] = J[x] for the extremal of the fixed endpoint problem x, then we would 
have 


AJ = J{@|= Jie] = sta) Ke] = f ear > 0, 


and x(t) would be a minimizer for J. We will prove that (2) always holds. In fact, we will 
now show that K is path independent in the sense that any curve X(t) produces the same value 
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for K [%]. For this reason — and because David Hilbert discovered this integral — K 1s called 
Hilbert'’s Invariant Integral. 


Lemma 7.5.7. The integral 


a A Of 
K[x]= | F(t, X, p) + (% — Pyan x, p)dt 

is independent of the path x(t) joining the endpoints xo and x). 
Remark 7.5.8. Notice right away that K [x] = J[x] for an extremal x since, in this case, p = x. 

Instead of proving the lemma directly, we are going to consider the two-variable case because 
it has its uses in geometry and is rarely presented explicitly. Consider the fixed endpoint prob- 
lem with endpoint conditions x(to) = xo, y(to) = yo, x(t) = x1, y(t) = y; and integral to be 
minimized 


ty 
J[x, yJ = i F(t, x, y, x, dy) dt. 
fo 


Suppose that (x, y) = (x(t), y(t)) is an extremal for the problem. Further, assume that a field of 
extremals exists with two-variable slope function 


= (pi. pr) = (%, 5). 


The associated Weierstrass £-function is 
E(t, %, ¥.%, Vs Ps P2) =f (t, Vs XV) — F(t, J, Pi. Pr) 
~~ of. nk a OF oe 
— (% — py )——(t, X,Y, pi. p2) — (Y — P2)——(t X,Y, Pi, pr) 
OP\ Op2 


and Hilbert’s invariant integral is 


2 pa ar 
K[x, ¥] = (eee Si Ce is 
Opi 


ae Of ue x 
ly = Pa) Ut, X,Y, Pi, p2)at 
P2 
af af af of 
= sat > dy, 
fr Pl ap, Pap, Op dpa 


where we have used ¥ dt = dX and Ydt = dJ. The last equality tums K into a line integral and 
we can now use the standard fact about line integrals that they are independent of path (in a simply 
connected region) if and only if the associated vector field has a potential function. Recall that a 
vector field on an open simply connected region O C R? isa mapping F: O > R?, F(t, x, y) = 
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(A(t, x, y), B(t, x, y), C(t, x, y)) and a line integral of F along a path a(t) = (¢, x(t), y(t)) with 
differential da = (dt, dx, dy) is given by [, F-da = f Adt + Bdx + C dy. The vector field 
F has a potential function if 


F=(A, B, C) = gradgd = (dr, bx, by), 


where grad denotes the gradient and @¢, etc. denotes partial derivative with respected to the 
subscript variable. Clearly then, the condition for having a potential function is 


dA 9B dA aC aB_ ac 
ax ays ay Ox 
In particular, for K [X, y] to be path independent, we need 
Of — Piap ~ Prin) _ 9 af 
ax ~ at apy 
_ ef ef ap af apr 
~ Opiat ap? at = ap,9p at 
af — igh - Prop) a of 
ay ~ at apr 
_ ef af ap af ap 
"Op dt aps at = Api dpy at 
Of -. of 
apidy — prdx 


The third condition is a condition on the field of extremals and we will make the tacit assumption 
it always holds. Such a field of extremals is called a Mayer field. See [Sag92] for an in-depth 
discussion. Note that, in the one-variable situation, the third condition holds vacuously. In the 
following, we shall prove that the first condition holds. The second condition follows similarly. 
Also, in the proof below we will use the “hatless” notation x instead of X for the sake of simplicity. 
This does not mean however that x is an extremal, but rather that we are interested only in formal 
partial derivatives without regard to particular curves. 


Theorem 7.5.9. K[Xx, y] is independent of path. 
Proof. The proof is a tedious, but instructive exercise in the chain rule. Let’s carry out the 


differentiations listed above for (1), set the quantities equal which we want to be equal to each 
other and search for a true identity. Case (2) will follow similarly. 


Cf — pigk — przh) Ear af fap, PF | am 
ax ax 'laxap; | ap? Ox | apap. Ox 

af af ap at 2] 

35 ae Sa oP) 

P?) yap, | dp? dy | apap, | dy 
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which we set equal to 


_ of fap | #F ap 
~ Op\dt ap? at — Apap at 


which, by simplifying and isolating df/0x gives 


af af  afapy af apr af 
dx dtdp, | dp? Ot | dpidp2 Ot dxap, 
7 af 5 a’ f api ie af dp ; a7 f apy ae af dap 
2 WO. GR ers 9 A 2 oar 
dyapi apy ax apap2 ax dp; ay apap, dy 


a f af 7 af Racal (# dp dpi 
"Oxdp; ° dydp, ape 


ar yr praia 


af (apr apo apo 
LLasreg pee iciaee pein 
a (= Pip he 7) 


Now, along an extremal, p; = x and p2 = y, so differentiation produces the following general 
relationships among p; and po. 


a a a a a a 
jie ce eel pam Pe eae 


Substituting, we obtain 


af af a f vf af. af 
a Pi + p2 ag Pie 
Ox otap, Oxdp, dydp) Opi OP 0P2 


P2. 

Note that this is an expression in formal partial derivatives of f and in components of the slope 
function associated to a field of extremals. We can find another relationship between p; and p2 
by using the fact that they are associated to extremals. Namely, along an extremal, the Euler- 
Lagrange equations hold, so we have, upon replacing the usual x notation by p; and carrying out 
the chain rule, 


a (3 a Gf a° f 4 OP fe OF 3 “4 ef . af 
at api) atap, | 'axap, | ayap,  Opz?' | apiapy” ax 


which is true by the Euler-Lagrange equation. Comparing this equation with the one above, we 
see that they are identical, so we have found the required identity. Therefore, working backwards 
up the chain, we see that all the equations are true. Hence, we have proven condition (1) and 
(since (2) follows the same way) K is independent of path. O 
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Corollary 7.5.10 (The Weierstrass Condition). [fa trajectory (x, y) = (x(t), y(t)) is a member 
of a field of extremals for a fixed endpoint problem with integral J and 


E(t, x, yi xX; yy Pi; P2) = 0, 
then (x, y) is a minimizer for J. 
Proof. Although we explained earlier what the proof should be, we repeat it here for conve- 
nience and completeness. Let X denote any other curve joining the endpoints. Again, recall that 


K[(x, y)] = J[(, y)] for an extremal (x, y) since the terms involving (x — p;) and (y — pz) 
vanish. 


AJ = J[&,¥)) — JIG, y)] 
= J[@, ¥)] — KI@, y)] 
= J[@,9)] — KIC. ¥)] 
since K is invariant, so 
AJ= / E(t, x, y,x,dy, pi, pr)dt > 0 


since E > 0. Hence J[(x, ¥)] = J[(x, y)]. U 


Exercise 7.5.11. In the case of one variable x, show that K [x] is independent of path. Then show 
that if x is a member of a field of extremals and E(t, x, x, p) > 0, then x is a minimizer. 


Example 7.5.12 (Weierstrass E). 
Consider the fixed endpoint problem with x(0) = 0, x(2) = 2 and integral 


2 

1 

z=} =P 4xitxtidt. 
a2 


An extremal is x(t) = t*/2 and a field of extremals may be defined by xg(t) = e + d. Let’s 
compute the Weierstrass E-function. 


1, 1 
E(t, x, x, p)= ia P+ xk tata 5p) —xp—x—p—(—pp+xt) 
1 1 : 
=o ar PAP 
1, ] 
=r bah ae? 
1 
age — 2% p+ p’] 
1 2 
=56 x — p) 
> 0. 


Hence x(t) = t?/2 is a minimizer for J. 
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Example 7.5.13 (Weierstrass E for the Brachistochrone). 
Assume that the brachistochrone problem has a field of extremals. Then, the E-function has the 
form 


Vie flap? es Pp 
Ju vu Ju(l + p*) 
_ vi+u? Jl + p?—(1+ p*)— pu + p? 
Vu(l + p*) 

_— vitu? Jit p?—1— pu 
Vu(l + p?) 

_ 1,4 %G, P=, ¥)- Cp) 
Vu(l + p*) 

where | | denotes length and - denotes dot product. We then have 

_ 10,4), p)|— G,#)-C, p) > 0 
Vu(l + p’) 


by the Schwarz inequality. Hence, the cycloid is a true minimizer for the brachistochrone problem. 
Compare [HK95] and, especially, [Law96]. 


E (u’ — 


E 


Exercise 7.5.14. We have seen that straight lines are extremals for the arclength integral 


J= / Vx7(t) + p(t) dt. 
Clearly, by simply translating the line up and down we obtain a field of extremals. Compute 
the two-variable Weierstrass E-function and show that straight lines really do give the shortest 
distance between two points in the plane. Hint: follow the brachistochrone example. 
Exercise 7.5.15. Show that the Weierstrass E-function is always nonnegative for the extremal 
of the surface of revolution of least area (see Example 7.3.7 and Exercise 7.3.8). The reason that 
the catenoid does not always minimize area is that, beyond a certain point, it no longer sits in a 
field of extremals. 
Exerciset 7.5.16. For the fixed endpoint problem of Exercise 7.1.13, 


3 1 
ra x sint+ —x°dt 
5 2 


with x(0) = 0, x(7) = 2, compute the Weierstrass E-function to show that the extremal is a 
minimizer. 


Exercise 7.5.17. For the fixed endpoint problem of Exercise 7.1.14, 


1 
J =p xx? 4x7 xdt 
0 
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with x(0) = 1, x(1) = 4, compute the Weierstrass E-function to show that the extremal is a 
minimizer. 


Exercise# 7.5.18. For the fixed endpoint problem of Exercise 7.1.15, 


n/2 
/ x? — x? dt 
0 


with x(0) = 0, x(27/2) = 1, compute the Weierstrass E-function to show that the extremal is a 
minimizer. 


Exercise 7.5.19. For the fixed endpoint problem, 


I 
[e-a 
0 


with x(0) = 0, x(1) = 1, compute the Weierstrass E-function to show that the extremal is a 
maximizer. 


The Weierstrass E-function provides a powerful tool for identifying minimizing curves. Nev- 
ertheless, it does not provide a panacea. For one thing, we are usually able to show E > 0 only 
when some sort of special algebraic structure is present. In particular, we usually prove E > 0 for 
an arbitrary variable p as opposed to actually using specific properties of a particular slope func- 
tion p. This is so simply because information about p rarely helps. Therefore, while it may not 
be hard to show E nonnegative for relatively simple problems, for more complicated problems, 
the algebraic complexity goes up exponentially. Secondly, the existence of a field of extremals is 
by no means something which is guaranteed. Indeed, there is a Jacobi differential equation akin 
to that of Chapter 5 and a notion of conjugate point which detects whether an extremal sits in 
a field. Specialized to geodesics, this explains why going past a conjugate point on a geodesic 
implies that the geodesic is not minimizing. This general Jacobi condition is too complicated 
to be presented fully in our naive context, but the interested reader should see [Pin93] for an 
elementary (and brief) discussion, as well as [Bli46, Ewi85, Sag92]. We will only present a sketch 
of the situation for geodesics in the following example. For details, see [Hsi81]. 


Example 7.5.20 (The Jacobi Equation Revisited). 
Suppose that the v-parameter curve a(v) = x(0, v) is a shortest distance geodesic joining two 
fixed endpoints and the u-parameter curves x(u, vp) are geodesics orthogonal to a. In fact, it can 
be shown that such a patch always exists in a region near @ and, in this case, the parameters u 
and v are called Fermi coordinates. By taking v to be the a arclength parameter, we obtain the 
metric coefficients E = 1, F =Q and G > 0 with 


GO, v) = 1 and G, (0, v) = 0. 


The first equality follows from the fact that the geodesic a = x(0, v) is parametrized by arclength, 
so G(0, v) = xX, - xX, =a’ - a’ = 1, The second equality follows from the formula developed for 
geodesic curvature in Theorem 5.4.12. We have k, = 0 since @ is a geodesic and the angle from 
a’ to x, as well as the metric coefficient E = | are constants. Hence, G,,(0, v) = 0. The equalities 
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above are saying that, along the curve a, the metric looks like a Euclidean metric up to first-order 
derivatives. This allows us to carry out a “second variation” below as if we were in the plane. 


Exercise 7.5.21. Use the equalities G(0, v) = 1 and G,(0, v) = 0 to show that, along the curve 
a = x(0, v), the Gauss curvature of the surface is given by K = —G,,,(0, v)/2. Hint: use Theo- 
rem 3.4.1. 


Now let’s choose a variation of a of the form x(en(v), v) where € is small and n(a) = 0, 
n(b) = 0 (and a(a), a(b) are fixed endpoints on a). The arclength integral is then 


b 
J@Q= / fen? + G(en(v), v)dv 
a 
with €-derivative equal to 


_ dj 
~ de 


4 en” + Gun 
Py ees aa ae ea 
a 2/fe2n! + G(en(v),v) — le=0 
since we assume a to be shortest distance. Also, since a gives a minimum for arclength, the 
second derivative of the arclength function (at ¢ = 0) must be nonnegative. Therefore, we obtain 
aI /de? = 
[ [2n'* + Guan] e?n’? + G — Len” + Gunl[2en” + Gunl/2Ven? +G 
v 
: en? +G 
- i 2[2n"* + Guat? Jle?n”” + GI — Ren” + Gun 5 
= Z 2(€2n? + G)3/2 . 


e=0 


Upon evaluation at « = 0, we get 


aj ie 2[2n? + Guy (0, v)n?] GO, v) — G2(0, v)n? : 
oe = Vv 
de? 7 2G3/2(0, v) 


€=0 


b 
= / 2n? + Gyy(0, v)n? dv 


since G(0, v) = 1 andG, (0, v) = 0. Ifa truly is a shortest length geodesic joining the endpoints, 
then, as a minimum of the function J(e€), the integral above (i.e., the second derivative of J) 
must be nonnegative for all choices of n with n(a) = 0 = n(b). But the integrand of /’s second 
derivative is a function f(v, 7, 7’) and so an 7 giving the minimum of the integral (which we want 
to be zero) must satisfy the Euler-Lagrange equation for this integral. Thus we arrive at Jacobi’s 
secondary variational problem. Namely, we must calculate the Euler-Lagrange equation of /’s 
second derivative. 


d ’ 
2Gyu(0, v)n- at) =0 


n Gaul, v) 
n ee 
n’ + K(0,v)n =0 


=0 


since Gy,(0, v) = 2K by Exercise 7.5.21 above. Of course this is the Jacobi equation of Chapter 6. 
Now, here’s the connection between solutions of the Jacobi equation and geodesics of shortest 
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length. Recall that a point c € [a, b] is conjugate to a if there is a (not identically zero) solution 
of the Jacobi equation 7 such that n(a) = 0 and n(c) = 0. (We assume c 1s the first such point in 
the interval [a, b].) We shall show that the existence of a conjugate point implies that there is an 
i) for which the second derivative is negative; 

ay 

de e=0 
By the discussion above, this means that a could not possibly be a minimum for the arclength 
function. In other words, a geodesic is shortest length only up to the first conjugate point. 
(Technically speaking, we can actually only say that, for all curves sufficiently close to the 
geodesic, the geodesic is shortest length only up to the first conjugate point.) 

To prove that a solution 7 to Jacobi’s equation with n(c) = 0 gives a negative second derivative, 
we must use a technical tool from the calculus of variations called the Weierstrass-Erdmann corner 
condition (see [Pin93] or [Sag92] for example). This condition says that, along a piecewise smooth 
extremal x(f) (i.e., an extremal which is smooth except at a finite number of corners) for an integral 
J =f f(t,x, x) dt, the partial derivative 8f/8% must be continuous at a corner T. That is, if we 
take the limit of partials on both sides of the corner as we approach the corner T, we must get the 
same answer. With this in mind, let 7 be a nonzero solution to the Jacobi equation n” + K n =0 
(along w) with n(c) = 0 and n(p) # 0 for all p € (a, c). Define 


<0. 


mp) for p € [a,c] 
E(p) = 
0 for p € [c, b]. 
For this variation € (and using K = —G,,,/2), the second derivative is 
ad 


de? 


b 
=2/ &?_ K& dy 
0 a 


b b 
=2f edu ~2f Kédv 
a a 


c ¢ 
| n?dv-2f K n° dv 
a a 


since € = n on [a,c] and € = 0 otherwise. Integrating the first term by parts gives 
ad 
de? 


€= 


(a c 
= 2nn' | - 2f nn dv —- 2f K n° dv 
e=0 a a 


a i nln" + K nldv 
a 
ap, 


since 7 is zero at a and c and since n satisfies the Jacobi equation. So, € gives a value of zero 
for the second derivative. Could & in fact be a minimizing curve for the second derivative? If so, 
then the Weierstrass-Erdmann corner condition would have to be satisfied. On one side of the 
comer c, the 7’ derivative of J’s second derivative’s integrand n? — K 7? is 2n’ while on the 
other side (where & = 0), the 7’ derivative is zero. Hence, we obtain n’(c) = 0. But we already 
know that 7 is a solution to a second-order linear differential equation (the Jacobi equation) and 
n(c) = 0. By the uniqueness of solutions of such differential equations with given initial data, it 
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must be the case that 7 = 0 identically. This contradicts our assumption about 7. Therefore, the 
Weierstrass-Erdmann condition cannot hold for 7 and, consequently, it cannot be a minimum for 
the second derivative. Since n gives a value of zero for the second derivative, there must exist a 
piecewise smooth function 7 which makes the second derivative negative, 


d’J 


7a| [a] <9. 
de? €=0 


In fact, any corners of such a function may be “rounded off” to produce a smooth such 4. The 
existence of such an 77 then implies that the original geodesic @ is not a minimum of arclength for 
any point past c. This example illustrates the importance of the Jacobi equation in determining 
whether or not extremals actually minimize. Also, from the definition of the patch x(u, v) itself, 
we see that the Jacobi equation is intimately connected with the existence of fields of extremals. 


Besides these questions about fields of extremals, the E-function and conjugate points, there 
1s a question which we have avoided up to now. While the Euler-Lagrange equations provide 
a necessary condition for an extremum, there may in fact be no solution to a given variational 
problem. For instance, if we consider the plane with the origin removed and ask for the curve 
of shortest length joining (1, 0) and (—1, 0), then there is no solution because the straight line 
extremal joining the endpoints passes through (0, 0) and, therefore, is not allowed. In the calculus 
of variations, each problem must be analyzed separately to determine if a solution exists. Of 
course, the question of what types of functions may be allowed as solutions arises as well. 
Usually, as in Example 7.5.20 above, an admissible class of functions for variational problems 
is the class of piecewise smooth functions. These are continuous functions whose derivative(s) 
exist and are continuous everywhere except at a finite set of points where corners occur. The 
following exercise indicates some of the subtleties of the subject. 


Exercise 7.5.22. For the fixed endpoint problem with x(0) = 1, x(1) = 0 and 
: l 
4 : 
stl = | x"+xx+-dt, 
0 2 


a.) Show that the only extremal is x(t) = 0, so that there is no minimizer. 
b.) Show that J[x] > 0. Hint: show that J[x] = f> x‘ dt. Use i dt = dx. 
c.) Show J[x] gets arbitrarily close to 0. Hint: consider the piecewise linear function defined by 


« l—et O<r<} 
x = 
0 | 


and compute J [x] for these functions. 


There are various methods for analyzing whether extremals are minimizers, many of which 
can be found in texts such as [Bli46, Sag92, Ewi85] for example. In particular there is a general 
approach which depends on the so-called second variation and, while we do not have space to 
discuss this in generality, we point out that Example 7.5.20 above exemplifies this method. See 
[dC76] for a good discussion of the second variation as applied to geodesics and [Opr00] for an 
application of the second variation to detecting whether minimal surfaces actually minimize area 
(see Schwarz’s theorem). 
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7.6 Problems with Constraints 


7.6.1 Integral Constraints 


Often, variational problems come with extra side conditions which must also be satisfied. For 
example, we saw in Chapter 4 that we can minimize the surface area of a compact surface 
enclosing a region of space subject to the constraint that the enclosed volume remain constant. 
Recall that the minimizer surface turned out to be a sphere. (For a variation of this constrained 
problem designed to describe the shape of a mylar balloon, see [Pau94].) In fact, the most ancient 
of all these types of problems is the following two-dimensional version: given a closed curve 
of fixed arclength, what is the shape the curve should assume to maximize the enclosed area? 
Because of this, constrained variational problems are sometimes called isoperimetric (1.€., same 
perimeter) problems. In this section we shall see what modifications are necessary in our previous 
work to handle variational problems with constraints. The standard problem shall be, 


ty 
Minimize J = / f(t, x. x)dt 
to 
subject to the endpoint conditions x(t)) = xo, x(t;) = x; and the requirement that 


fy 
r= g(t,x,x)dt=c 


to 
where c is a constant. 

We will now derive a form of the Euler-Lagrange equation which is a necessary condition for 
the solution of the constrained problem. Just as before, assume x = x(t) is a minimizer for the 
problem and take a variation ® = x + € (an + b&) where n(to) = 0 = n(t;) and (to) = 0 = &(t). 
We take two “perturbations” n and € because taking only one would not allow us to vary J while 
holding 7 constant. By taking 7 and &, we can vary J while offsetting the effects of one 
perturbation with the other in /. The usual Euler-Lagrange argument gives 


_ ft afd (af 
ao Lenten (2-58) # 


Now, however, the usual argument must be modified because an + bé is not arbitrary. The 
requirement / = c puts restrictions on a and b. If we carry through the argument above for /, we 


get 
d {0g 
Fel” for (3s ~ ae (as) # 


where the derivative is zero since J = c is a constant. But x is not an extremal for J, so the 
Euler-Lagrange equation with g does not hold. Instead, we have 


rie: rie 
[anton (F- 5 (#)) a=0=[ (an + bE) (2 - = (3)) dt 


which produces 


0= — 


0= 
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Upon rearranging we obtain 
Se (#- a (#)) a _ Son (#-a(%)) 


he(Eoa(H) a” fn (ESS) a 


The left-hand side is a function of € while the right is a function of 7, so the only way for 
these expressions to be identical is for both expressions to be equal to the same constant A (the 
Lagrange multiplier). Simplifying the expression 


fen (-$(8)) a 


ta Eoal) 


te PAG = Te) (OE AON x. 
fol ax | aE )]a=0. 


Now the usual argument may be continued. Because 7 is arbitrary, the previous equation can hold 
only if the term in brackets vanishes. We thus obtain the following Euler-Lagrange necessary 
condition for the constrained problem. 


gives the equation 


Theorem 7.6.1. [fx = x(t) is a solution to the standard constrained problem, then 


Aras) 4 (AL=48)) 9 
ax dt ax oes 


Example 7.6.2 (Bending Energy and Euler’s Spiral). 

As we saw in Section 7.4, the bending energy of a plane curve is defined to be the integral of 
squared curvature over the length of the curve, { «?(s) ds. Recall from Chapter | that the curvature 
of a plane curve also is given by 6 = d@/ds, where 0(s) is the angle the unit tangent T(s) makes 
with the x-axis. As we shall see when we discuss Hamilton’s principle, it is not unreasonable to 
assume that a wire will try to assume a shape which minimizes bending energy. (For a biological 
application of this idea see [Can70] as well as [DH76a] and [DH76b].) Therefore, we can ask the 
following question. What shape will a wire take if the total turning of its tangent is fixed and the 
turning is zero at the endpoints? As a constrained variational problem, we write 


1 1 

Minimize J ah 1 92 ds subject to I = Ods = 2 

0 2 0 6 

with 6(0) = 0, 6(1) = 0. The factor 1/2 in the first integral is only put in to make calculations 
cleaner. Minimizing bending energy is clearly equivalent to minimizing one-half of bending 
energy. Similarly, the 1/6 value of the second integral is chosen only to avoid fractions later 
on. This said, the Euler-Lagrange equation for the constrained problem is —A — 46= = 0 with 
solution 


ee 
O(s) = ree +s). 
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Figure 7.3. Euler’s spiral (the spiral of Commu) 


An application of the constraint gives 


Hence, A = 2 and 6(s) = —s? +. Further, a field of extremals is given by 0g = —s* +5 + 
d and the Weierstrass E-function is E = 1/2(6 — p)* > 0. Hence, @ is a true minimizer for 
bending energy subject to the constraint. We have met the curve corresponding to 6 previously in 
Exercise 1.5.14. Recall that a plane curve £(s) with curvature « = d@/ds may be reconstructed 


by the formula 
B(s) = (/ cos(o(u))du, | sin(@(u)) du). 


In the case at hand, 6(s) = —s* +s and we have a form of Euler’s spiral (see Figure 7.3). 


Remark 7.6.3. The reader should compare and contrast the preceding example with the classical 
problem of the elastic rod (see [Sag92]) of fixed length L. Suppose a rod is clamped at points 
(a, ya) and (b, y,) at fixed angles 6, and 6, respectively. The shape the rod assumes at equilibrium 
is determined by minimizing the potential energy of the rod and this quantity is proportional to 
f «2(s) ds. The constraints of the problem are determined by the fixed angles. Namely, projections 
onto the x and y axes give, fF cos@ ds = b—aand sin@ ds = yp — yq. Therefore, using two 
Lagrange multipliers, we extremize the integral 


£ /do\? 
J= —} —A,cos@ —d2siné@é ds 
0 ds 
to obtain the Euler-Lagrange equation 


: d dé 
A; sin@ — Az cos @ — ae (22) = 0. 
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Exercise 7.6.4. Integrate the Euler-Lagrange equation with respect to s to obtain 


Explain why all inflection points of the elastic rod lie on a straight line. 


If the line determined in Exercise 7.6.4, 4 yt By +C =0, is rotated to become the new y 
axis, then the shape of the elastic rod has the expression 
x7/2+4a 


»~ | Jim@ifi tay 


Exercises 7.6.5. Solve the constrained problem 
My 7 
Minimize J = i; al +xxdt subject to I =) xdt = — 
0 0 
with endpoint conditions x(0) = 0 and x(1) = 0. 
Exercise 7.6.6. Solve the constrained problem 
i l ! 
Minimize J =) xt — oe dt subject to I -/ xdt=1 
0 0 
with endpoint conditions x(0) = 0 and x(1) = 1. 


Exercise 7.6.7. In the notation of this section, show that (under the assumption that the Lagrange 
multiplier A is nonzero) a minimizer x(t) for J subject to fixed / is a minimizer for / subject to 
fixed J. 


Exercise 7.6.8. From Exercise |.1.22, we know that a freely hanging chain of fixed length hangs 
in the shape of a catenary y = c cos(x/c + d). Show that this follows not only by the force 
diagram of Exercise 1.1.22, but by energy considerations as well. Namely, the potential energy 
of the hanging chain is proportional to f°" yl + y’? dx and equilibrium position of the chain is 
assumed when potential energy is minimized. The problem is constrained, however, by the fixed 
length of the chain L = i i /1+ y? dx. Therefore, the equilibrium shape y will be attained 
when the following integral is extremized: 


y= fo yfity?—ayit ye. 
x 


0 


Extremize this integral to obtain a catenary. Hint: the integrand is independent of x. 


Example 7.6.9 (The Isoperimetric Problem). 

Suppose we have a closed curve of fixed length L = f x? + y? dt and we wish to enclose 
the maximum amount of area. Green’s theorem allows us to express area as a line integral 
{(1/2)(—yx + xj) dt so that the variational problem becomes: 


Maximize J= f (1/2\-yé +25) avi + Fed 
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The two-variable Euler-Lagrange equations are 


y ad y AX x6 x dfy Ay esiG 
2 dt\ 2 fezzyzp 2 dN 2 feeagey 


which may be rewritten as 


df, hi ; d , 4s ; 
— ————. | = 0, —{-x + —_—— ] = 0. 
dt y [x2 4 jy? dt /x2 + y2 
We then have 
Ae ay 


—S=— = and —x+——— =-D 
aide Ey? 


Ve = 
Vx? + y? 
and a little algebra produces 


(x -— DP +(y-—CY =H’, 


acircle ofradiusr = A centered at(D, C). Because, foracircle,L = 2mr,wehaver =A = L/2n 
with area A = L*/(47r). The isoperimetric inequality Theorem 1.6.1 then verifies that this circle 
is the shape of fixed length L which maximizes the enclosed area. Once again we see that the 
calculus of variations has the power to endow geometry with a motivating “principle”. 


Exercise 7.6.10. Show that the two-variable Weierstrass E-function is always non-positive for 
this problem. Thus, assuming it may be embedded in a field of extremals, the circle provides a 
maximum for the integral J and is, therefore, the curve of fixed length which encloses the largest 
area. 


Exercise 7.6.11. Show by variational methods that the circle also solves the problem of finding 
the curve of shortest length which encloses a fixed area. Then show the same thing by the following 
argument. Let a be a closed curve of fixed area A and arclength L. Since the circle solves the 
problem of finding a curve of fixed length having maximum area, we must have A < L’/4r. 
Take a circle of area A and show by simple algebra that the arclength of this circle is smaller than 
L. Hence, for given area A, the circle has smallest arclength. 


Exercise 7.6.12. The circle is the solution to the isoperimetric problem and the thing which 
characterizes the circle as a plane curve is its constant curvature (see Theorem 1.3.21). Suppose 
now we want to solve the isoperimetric problem on a surface M. That is, we want to find a closed 
curve C of fixed arclength which maximizes the surface area of the region of M which it encloses. 
Show that such a curve must have constant geodesic curvature. Hints: using Exercise 4.4.2 and 
Exercise 5.1.4, show that the constrained problem of maximizing surface area while keeping 
arclength fixed gives {.(1 —A«g)(U x T)- V ds = 0 for all (appropriate) vector fields V. By 
the usual arguments, kK, = 1/A. 


7.6.2 Holonomic Constraints 


Integral constraints such as those above are not the only types of constraints which arise in 
variational problems. We shall consider one more class of constraints which pertain to geometry 
and mechanics, the so-called holonomic constraints. Consider the following situation: a marble 
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is placed inside a frictionless bowl in the shape of a hemisphere and the marble is given a 
horizontal push. What is then the trajectory of the marble inside the bowl? There are two forces 
to be dealt with here, gravity and the force which constrains the marble to remain in the bowl. 
It is a principle due to D’Alembert that the force of constraint is a normal force to the bowl 
and so does no work on the marble. See [Arn78] for a discussion of holonomic constraints 
and D’Alembert’s principle. What this all means is this. In order to discover the equations of 
motion of the marble, it suffices to consider the gravitational potential V(x, y, z) = mgz and 
kinetic energy T = (1/2)m(x? + y? + 2”) restricted to the surface itself. In other words, because 
a parametrization of the sphere of radius R is given by 


x(u,v) =(Rcosu cosv, Rsinu cosv, Rsinv), 


we obtain the restricted forms of V and T, 
1 
V =mgRsinv, T= 5m R(cos” vi? +0’). 


To show the latter, we can either compute x, y and z directly (by the chain rule) in terms 
of u and v or we can reason as follows. The path of the marble is given in three space by 
a curve a(t) = (x(t), y(t), z(t) with velocity vector a’(t) = (*, y, z). The kinetic energy may 
therefore be written T = (1/2)|a’|*. As we have seen many times before, the chain rule gives 
a’ =X, u +x, v and, using F = 0, we obtain the kinetic energy by 


la’? =x, -x,wW+x,-x, 0" 
jr’ P =EW+G0 


1 12 .2 
Page + G0"). 


Of course, for the R-sphere, E = cos* v and G = 1, so we obtain the expression for T above. 
What we have just done embodies the essence of the term “holonomic constraints”. In fact, we 
may say that constraints are holonomic when they arise by simply inserting the parametrization 
of the constraint surface into the usual mechanical quantities such as kinetic and potential energy. 
By the discussion above, the action integral which is relevant to the marble in the hemispherical 
bowl is 


l 
J= if 5m R°(cos” vie + v) —mgRsinv dt. 


The Euler-Lagrange equations for this two-variable problem are 


2 


mR’ cos? vite = 


—mgRcosv — mR? sinvcos vw? — mR?i = 0, 


or, more simply, 

: c sf & : <2 
t Uhm rca and Mehta en COea = 0. 
Here we have used the fact that the integrand is independent of u. When we use Maple later 
(see Subsection 7.10.4), it will be unnecessary to do this. These are then the equations of motion 
of the marble. The trajectory corresponding to these equations (and plotted by Maple with 
g =m = R = 1) is shown in Figure 7.4. 
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Figure 7.4. Marble in a hemispherical bow! 


Exercise* 7.6.13. Find the equations of motion of a marble in a paraboloid shaped bowl. Take 
x(u,v) =(ucosv. “sinv, u?) to be the paraboloid’s parametrization. 


We can see the influence of geometric ideas on physics if we consider the case of a particle 
constrained to a surface M : x(u, v) with zero potential, V = 0. In physics it would be said that the 
particle moves inertially on the surface — that is, only under the influence of the constraint forces. 
According to D’Alembert’s principle, these forces are normal to the surface, so the particle’s 
acceleration is likewise in the normal direction. In geometry we recognize the path of such a 
particle as a geodesic. Indeed, we have (taking, for simplicity, an orthogonal patch) 


Theorem 7.6.14. /f the potential is zero, then, according to Hamilton's principle, constrained 
particles move along geodesics. 


Proof. The kinetic energy is given by 
T baat l ne | E +2 G +2, 
— zie = zn u+Gu") 
with action J = / T dt. The Euler-Lagrange equations (with m divided out) for this action are 
E, +2 Gy >) d . Ey hy! G, 2 d 
lad —p — —(En) = — nH" —j = — by) = 0. 
ria 4 5 v Tt u)=0 and 5H + 5 i We”) 
The t-derivatives are 
a . ae ‘i 
Wee) = (E,u + Ey,t)a+ Ett 


d 
Wa") = (GG, + Gyi)b + Gi 


sO, rearranging and simplifying, we obtain 
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and 
» Evin Gu... | Guy 
-— — —v =0. 
v 3G" + Gu ae! 
By Hamilton’s principle, these are the equations of motion of the particle. By our work in 
Chapter 5, they are also the equations describing a geodesic. O 


Exercise 7.6.15. Suppose, out in space where gravity is zero, a marble is rolled inside a frictionless 
cylinder x(u, v) = (cos v, sin v, u) with kinetic energy T. By extremizing the action 


[ra 


constrained to the cylinder, find the equations of motion of the marble for the following three 
cases of initial conditions: 


(a) u(0) = 1, u(0) = 0, v(0) = 0 and v(0) = 1. 
(b) u(0) = 1, u(0) = 1, v(0) = 0 and v(0) = 1. 


(c) u(0) = 1, 4(0) = 1, v(0) = 0 and 0(0) = 0. 


Graph your results using Maple. Show the trajectories on the cylinder. Because the potential 
V = 0, these trajectories are geodesics on the cylinder. 


Exercise 7.6.16. Suppose the constraint surface is a surface of revolution x(u, v) = 
(h(u)cosv, h(u)sinv, g(u)). Take the v-Euler-Lagrange equation of J = fT dt (as above) 
to get h*v = b constant. Also take the first integral (see Exercise 7.1.10) to see that an extremal 
a has constant speed. Then use an approach similar to that following Proposition 5.2.16 to derive 
the Clairaut relation A(u) sing = c. 


Exercise# 7.6.17. Show that the two-variable Weierstrass E-function for the action J = f T dt 
is always nonnegative. Therefore, whether or not a geodesic minimizes the action is simply a 
question about the existence of a field of extremals. In fact, conjugate points (see Theorem 6.8.10) 
are exactly those points which tell us that a field of geodesics (i.e., extremals) ends, so geodesics 
no longer minimize arclength past conjugate points. 


Exercise 7.6.18. If V = V(u, v) is a nonzero potential for the problem of a particle constrained 
to a surface M (with orthogonal patch), show that the equations of motion for the particle (with, 
say, m = 1) are 


E G V, 
Eu 2, Ev Guia Ww 
aT a ae E 

and 
V. 
< Biga Gig . Case Vu 
Yan eto ro 


Recalling how the geodesic equations arose, we may write these equations as 


“ i 
Qian = Vor? = —grad,, V, 
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where the M-gradient of V is taken with respect to the metric of the M. In order to understand 
this, we must think of what the gradient really means. In Euclidean space we have grad V -e; = 
dV /dx;. The general definition requires that grad,, V satisfies the following condition. 


grady V-x, =dV(x,), grad, V-x, = dV(x,), 


where dV = V, du + V, dv and du, dv are 1-forms dual to x,, x, respectively (see Section 8.6). 
This means du(x,) = 1, du(x,) = 0 and similarly for dv. If we write grady V =ax,+bx), 
then show that a = V,/E and b= V,/G. 

Thus, the M-gradient of the potential is the obstruction to the particle travelling along a 
geodesic. Of course, we usually write —grad V = F for the force in a conservative system, so we 
see that, from a physical viewpoint, forces are the things which cause particles to deviate from 
geodesic paths. This is akin to Newton’s first law. 


7.6.3 Differential Equation Constraints 


Now suppose that we want to extremize an integral 


q 
Sf F(t, Xp, X2, 0 Xe XH, - 0 Ky) at 
fo 


where the x;(t) are k functions of t such that an extremal also satisfies the set of N < k differential 
equations 


G(X), X2,- 0 Key X1, X2, 0 Xe) =O, 2.6, Gy (xy, X20, 0 Xe Xt, Xa, Xe) =O. 


The same derivation as usual (with variations x7 = x; + €n;) gives 
; na of . Gs) of , 
o=7= | (Gem + ein +--+ mt Som) at 
fo ! Xk Xk 


Now, the n; have a mutual dependence coming from the following equations derived from differ- 
entiating the differential equations G;. Because the variations also must satisfy the differential 
equation constraints, differentiating each G ; with respect to € gives (ate = 0) 
aG; aG; . dG; aG;. 
=m + thn to + me + Ok = 0. 
Ox, Oxy OX, OX’ 


Now multiply the jth equation by an unspecified function 2 ;(t) for j = 1,..., N and add these 
new equations to the integrand of J’(0) to get 


N 


"(| af aG; Of <n 4G; 
i Ox] 2, "ax; OXK 2, ’ Ox; 


Write F = f+ eer 4;(t)G ; and do integration by parts as usual to make the integral 


[ afd (aF\|) | [ard oF) ) ar =o 
» \Laxn dt Va) |™ ax dt Lai, |) 


7.6. Problems with Constraints 355 


Unfortunately, the 7; are not independent because they are related by the differentiation of the 
G,; above, so we cannot say each bracket is zero at this point. However, if the dG;/de are 


independent as functions, then because there are NV equations in the & unknowns 7;,..., 7, then 
we expect k — N independent n’s. Suppose that we order the variables so that 7;,..., 71 depend 
ON N41, +++5 Tk 


Now let j\(t),..., H(t) be any set of functions so that the first N brackets in the integral 
vanish. Then the integral becomes 


a oF d OF 4 4 OF d ~~} dt=0 
t) \LOXn41 at \OXn41 a ax, dt \a%) |™ ry 


But now the nn41,...,7 are independent, so our usual argument says that each bracket 


vanishes; 
OF d (OF ) 0 
ax; dt \ax;J} 
for j = N+1,...,k. Also, the choice of the 2 ;(t) gives 
OF d (dF\| _ 0 
ax; dt \ax;)}— 
for j = 1,..., N as well. Therefore, we have the usual system of Euler-Lagrange equations. 


Theorem 7.6.19. Extremals for an integral 


ty 
i=) S(t, X15 X2, 0. +) Xe X1, Koy... My) at 
tg 


(where the x;(t) are k functions of t) which also satisfy the set of N < k differential equations 
Gi(%1, X2,--., Xe, 1, X02, MDS 0,..., Gn (x1, Xo, 2. +s Xk, X1, X2,-.., Xe) =O, 


are given by solutions of 


OF d (aF 
pelea LS PE a es ee 
ax; a (ar) : 


which also solve the Gj, where F = f + pata Uj (t)G;. 


Example 7.6.20 (Second-Order Integrals). 
Let’s consider the situation of a second-order integral 


J= 3 f(t,x, x, xX)dt = i S(t, x,y, y)dt 


to fo 


with an extra differential equation 
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Take F = f(t, x, y, y) + u(t)(y — x) and form the Euler-Lagrange equations 


ar od (aF\ _y OF od FY 20 
ax at\ax) ° dy dt\ay) ' 
afd af d (af 
— — —(-p) =0, = —-—(—)=0, 
ae ae oe = (#) 
af du _ af df Ot \_% 
ax dt ay adt\ay}/ 


By eliminating 2. between the two equations, we obtain 
af od (a d* (a 
af moe af + — oF =0 
dx dt \dy dt*® \ ay 


afd (af a rary... 
a (ae) + ga (ge) 72 


And this is the usual second-order Euler-Lagrange equation. 


7.7 Further Applications to Geometry and Mechanics 


In this section we will look at geodesics as potential minimizers of arclength. Furthermore, we 
will generalize Theorem 7.6.14 to say that the motions of particles are always geodesics — with 
respect to a metric on the surface conformal to the original one. We will also see how the surfaces 
of Delaunay of Section 3.6 arise from a variational principle. 

We have said all along that, philosophically, geodesics should be curves of shortest arclength. 
We took the definition of geodesic to be a curve whose tangential component of acceleration 
vanishes, however, because this characterized lines in 3-space and seemed a more directly 
verifiable condition. We learned in Theorem 6.8.10 (and Example 7.5.20) that geodesics do not 
always minimize arclength, but we may still ask if they extremize arclength. As usual, we assume 
for convenience that the metric has F = 0. 


Theorem 7.7.1. Let M: x(u,v) be a surface with metric E and G. Then, extremals for the 
arclength integral 


J = [ Vee scar 
are geodesics on the surface M. 


Proof. We calculate the two-variable Euler-Lagrange equations to be 


E,wW+G,v 4d Eu = 
2WE+Ge dt (ssaz)- 
E,w+G,0 4d Gi - 
Saas a Vereen) > 
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Now we introduce the arclength parameter s(t) since we know geodesics must be parametrized 
to have constant speed. This means that we have the relations 


d d JER GE 
een sea eced pd Legeae 
dg oe ce 
, du dudadt u , du dvdt v 


“ds dtds JEPaGE jj ds dids JEPaOR 
+ 


Putting these into the equations above (and dividing out by the factor / E u* + G v2) gives 


E G d 

ae + eae — =(Eu') = 0 
E, 42 Gy y” 

= ~ —(Gv') =0. 
5 ue + —— 5 “( N= 


Carrying out the differentiation and simplifying produces 


Eu 72 Ey, u’ v’ Gy 12 


uae tla eee ere | 
u" + IE uot E" IE v 
and 
" Ey 12 G, te vi 72 
= = =0 
ge lage OG 
These are the geodesic equations. O 


Remark 7.7.2. The Weierstrass F-function for the arclength integral is computed to be 


VE i? + Gi2,/E p? + G p} — (Eup; + Gipy) 
VEW+GU 


The fact that E > 0 follows because the metric E, G gives an inner product to each tangent plane 
defined by 


E(t,u,v,u,v, pi, p2) = 


vow= Evy, + Gr2u2 


and the Schwarz inequality |v o w| < |v||w| holds for all inner products. Therefore, once again 
we see that the shortest distance property of existing geodesics depends entirely on the existence 
of a field of extremals. This in turn, recall, depends on the non-existence of zeros for solutions to 
the Jacobi equation. 


Exercise 7.7.3. Take the geodesic equations 


Ey 72 E, ray t Gy 42 
ult ypu + pul — sey 


7 E, 12 Gy tor Gy 42 
a =0 
v 2G" + G uv + G" 


=0 
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and multiply the first by —v’, the second by u’. Add the equations and compare with the formula 
of Theorem 5.1.5 to show that extremals for the arclength integral have zero geodesic curvature 
(and hence, are geodesics). 


Exercise 7.7.4. Consider the (arclength parametrized) Euler-Lagrange equations for the arclength 
integral above, 


d Ey 2 Gy 2 
—-(E , ae! Steen 
aa u) rs + 5 v 
d , Ey 72 G, 12 
—(Gv')= — pais 

ae v) rk + 5 v 


and let a(s) be an extremal (i.e., a solution curve for these equations). Show directly that the 
tangential component of a” is zero, Hints: write a’ =x,u’+x,v’ so that Eu’ =a’-x, and 
Gv’ =a’ -x,. Differentiate the right-hand sides of these equations and compare with the Euler- 
Lagrange equations. 


Now that we know that geodesics are extremals of the arclength integral, we can give a 
beautiful interpretation due to Jacobi of the equations of motion of a constrained conservative 
system. Recall that kinetic and potential energies have the form 


me 


= 5 (Ew +Gv*) and V=V(u,v), 


1 


2 


da 


at 


where @ is a particle trajectory. The action integral f T — V dt is independent of t, so the two 
variable first integral of Exercise 7.1.10 implies that energy H = T + V is aconstant along paths 
of motion. Specifically, 


,¢-V)_ oF —V) 
u v =C 
du av 


T-V- 


1 

5 (Eu! + Gt?) — V — u(Eu) — (Gd) =c 
1 
5 (EW + Gi?)—V — EW —- Gv’ =c 


l 
— 5 (Eu? + Gi") V=c 
T+V=>=-—c. 


Here and in the following we use H for energy to avoid confusion with the E of the 
metric. 

Now let’s take acurve w: | — M representing the motion of a particle determined by Hamil- 
ton’s principle on a constraint surface M. We take the metric of the surface to be E, F = OandG. 
Because we know that the path of the particle must conserve energy and because this condition 
is incompatible with the unit speed condition, we must explicitly reparametrize the curve to have 


7.7. Further Applications to Geometry and Mechanics 359 


constant energy. To do this, let Hp denote the constant energy along the path of the particle and 
define 


Vv Eu? + Gi? dt. 
=(\aaa ‘ a 


Lemma 7.7.5. The reparametrized curve a(t) has constant energy. 


Proof. First, note that the fundamental theorem of calculus implies 


dt 1 JER + Ge 
= = ——— V El’ + Gi? 
dt = /2(Hy — V) 
so that the chain rule then gives 
du du dt uf 2(Ao — V2(Ho — VY) 
dt dt dt ae JEW + GU 


dv dv dt du /2(Ay — V) 

dt dtdt —s dt JEW +Gy 
Using this, the following calculation then shows that this reparametrization conserves energy at 
the value Hp along the curve. 


1 du\? dv\? 
T(t) + V(t) = 5 | EC) () + G(r) (=) + V(r) 
a du\* 2(Hp — V) dv\* 2(Hy) — V) 
9 (« (F) Ew + Gv +6(2) Ee+on)}t” 


L (Ew+ Gt? 
Oy ae ao V 
A 5 (Satan) + 


=H—-V+V 
= A. 
O 


Furthermore, note that this calculation shows that T(t) = Hy — V. Also, the general energy 
equation T+ V =H gives 2T —H =2T —(T+V)=T — V. We shall use both of these 
relations in the following to determine the equations of motion of a particle as extremals of 
{ T(t) — V(t) dt. Note that, because of conservation of energy, these extremals must be found 
within the class of all curves with constant energy Ho. The first step is to observe that, from what 
we have said above, 


[ro —V(t)dt= fore — Hodt. 


Furthermore, since the derivative of a constant vanishes, adding a constant to an integrand has 
no effect on the Euler-Lagrange equations. Therefore, finding extremals for either side of this 


360 7. The Calculus of Variations and Geometry 


equation is equivalent to finding extremals for 
forma = [rer —V)dt 
d 
= [rt as Vyas 


= | 2(H) — V) ————— V Ew’ + Gi? at 
[20% nae tee 


= / V2(Ho — V)Eu? + 2(Ho — V)Gi? dt 
= [ Vie Ge dt 


where E = 2(Hy — V)EandG = 2(Ho — V)G defines anew metric on M conformal with respect 
to the the original E andG. (Here, we assume that Hp > V inthis region of the parameter domain.) 
It is now clear that finding an extremal for { T — V dt corresponds to finding an extremal for 
f VEw + Gv? dt. But, by Theorem 7.7.1, fy Eu? + Gv? dt has extremals which are geodesics 
on M with respect to the metric E = 2(H) — V)E and G = 2(Hy — V)G. Therefore, we have 
the following special F = 0 case of a result of Jacobi. 


Theorem 7.7.6. Leta denote the path of a particle with constant energy Ho under the influence 
ofa potential V constrained to lie ona surface M with metric E, F = O0andG. Then, for Hy) > V, 
a is a geodesic on M with respect to a conformal metric 


E=2(K)-V)E, F=0, G =2(H) — V)G. 


This theorem, while hardly the means to an explicit description of equations of motion, neverthe- 
less allows theoretical results about geodesics to be applied to mechanical systems. In particular, 
one of the most important questions about a mechanical system is whether or not there is a peri- 
odic orbit — that is, does some particle, say, return to its initial position after a certain amount of 
time. By the theorem above, this question is equivalent to asking whether a surface M (with any 
metric) has a closed geodesic — a geodesic a: [a, b] — M with a(a) = a(b). In 1951, Lusternik 
and Fet [LF51] (also see [Kli78]) proved a general theorem which, in particular, says that any 
compact (i.e., closed and bounded) surface does indeed have a closed geodesic. Applying this 
result to a surface in conjunction with Jacobi’s theorem gives 


Corollary 7.7.7. IfM is a compact surface and V(u, v) is a potential function with Hy > V(u, v) 
for all (u, v) in the parameter domain of M, then there exists a periodic solution to the equations 
of motion of a particle constrained to move on the surface under the influence of V. 


Example 7.7.8 (The Two Body Problem). 

The following comes from [Pin75]. Suppose that two bodies are mutually attracted with potential 
V depending only on the distance between the bodies. Mechanics tells us that the problem may 
be reduced to the case of single body planar motion in a central force field. The Jacobi metric for 
the motion along a constant energy curve has conformal scaling factor f = 1/,/2(Hp — V) so 
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that Exercise 5.4.2 gives the Gauss curvature, 


_1PVaetv , VitV2 
41 (Ho - VP (Ho - VP 


Exercise 7.7.9. Use Exercise 5.4.2 to verify the formula for Gauss curvature above. This curvature 
is called the mechanical curvature. Further, show that if V has a local minimum at (uo, v9), 
then K(u, v) > 0 for all (u, v) sufficiently close to (uy, vp). Similarly, K(u, v) < 0 near a local 
maximum. 


Exercise 7.7.10. Let V = —1//u? + v? be the Newtonian potential in a plane. Show that 


1 1 
Vey Ss and Via + Vov = -——— ee - 
i Y  (u? + v2? (u2 + v2)? 


By Exercise 7.7.10, the Newtonian potential V = —1//u2 + v2 gives 


7 Ho 7 No 
AAV a2 + v2 +:1))3 4( Hor + 1))° 


where r = Ju? + v? is the radial distance from the origin. Now, the constant energy equation 
(1/2)v? — 1/r = Ho gives v? = 2(Hor + 1)/r > 0. Hence, the denominator of K is positive. 
Therefore, 


H>O0< kK <0 
HW =00 K=0 
Ho <0e¢ K >0. 


For the two body problem, orbits are characterized in terms of energy (see [PC93]). Putting this 
together with the above gives 


K<0 & Hy>0 # _ the orbit is hyperbolic 
K=0 & H=0 #& _ the orbit is parabolic 
K>Q © H<0 # | the orbit is elliptical. 


Therefore, orbits are characterized in terms of their Gauss curvatures. In particular, a point where 
K > 0 guarantees a periodic orbit. 


Before we end our discussion of Jacobi’s theorem, we should point out explicitly that everything 
we have said holds true in higher dimensions as well. Also, it should be noted that one of the 
main features of Einstein’s geometrization of gravity is his assertion that particles, planets and 
light photons, say, travel along paths which are geodesics in a region of spacetime whose metric 
is determined by the mass-energy in that region. Considerable philosophical hindsight therefore 
allows us to view Einstein’s approach as a natural descendent of Jacobi’s theorem. 

Now let’s apply Hamilton’s Principle to some specific down-to-earth physical problems. 


Example 7.7.11 (A Pendulum). 
Suppose a pendulum of length @ is pulled away from its stable equilibrium (i.e., hanging straight 
down) and allowed to swing freely. What are the equations of motion? The coordinate we shall 
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use is the angle made by the pendulum and the vertical axis, denoted by y. The potential energy 
of the bob of mass M is then given by 


V = Mgh = Megf(1 —cos(y)) 
where g = 9.8 m/s? and h is the height above the point taken to have potential energy zero (i.e., 
the stable equilibrium). The kinetic energy is given by 


| 1 
T=-Mv==-Me’y? 
ei aes 


since the linear speed v equals the angular speed y times the radius 2. Then we have 
1 
L=T-V= sue y> — Mg&(1 — cos(y)) 


and the Euler-Lagrange equation is 


pa ok 4 (ab 
~ ay dt \ay 


d 
= —Megé sin(y) — a (Mey) 
= —Mglsin(y) — Me*¥ 
=yt+ ; sin(y). 


This then shows that the Euler-Lagrange equation in this case reduces to the usual equation of 
motion of the pendulum. 


Example 7.7.12 (A Spring-Pendulum Combination). 

Suppose a mass m is attached to a spring with spring constant k and is constrained to move 
horizontally. To the mass is attached a pendulum of length £ and bob mass M. We shall use the 
coordinates x and y to describe the motion where x is the horizontal distance of the mass m from 
its stable spring equilibrium and y (as in Example 7.7.11) is the angle made by the pendulum 
with the vertical. (Actually we have chosen to use these variables to conform with Maple work 
done later). Recall that Hooke’s Law says that the restoring force for a spring is F = —kx with 
potential energy V = kx?/2. Therefore, concentrating on the motion of the bob mass M, we have 
a potential energy of 


- kx? 


V= + Mgf(1 —cos(y)). 


The determination of the kinetic energy is a little harder. Let r and s denote the horizontal and 
vertical distances respectively of the mass M from equilibrium. Then we have 


r=x+ésin(y) and s =é-— €écos(y). 


Then the speed of M is given by 


va vr?+5* = JS (x + Lcos(y) y)? + 2 sint(y) y? 
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so that 


1 1 
T = 5m + 5M 
| 2 1 2 i 1 242 
= gint + 3M + Mécos(y) xy + gMe y. 


The Lagrangian for the system is then 


L= sm + M)x? + Mécos(y) xy + sme y> me — Mgt(1 — cos(y)) 
with Euler-Lagrange equations (from Exercise 7.1.7) 
—kx — * ((m + M)x + ME€cos(y) ») = 0 
(m+ M)k — ME sin(y) »* + MEcos(y) 3} = —kx 


and 
d 
—MEsin(y) xy — Mgésin(y) — a (ME cos(y)% + Me’) =0 


Me? + Mécos(y)% + Mgésin(y) = 0 
£¥ + cos(y) ¥ + g sin(y) = 0. 


These are then the equations of motion of the spring-pendulum system. These equations are not 
solvable by hand, but Maple may be used to numerically solve them and plot the motion of M. 
See Subsection 7.10.3 and Exercise 7.10.7. 


Exercise 7.7.13 (The Double Pendulum). Suppose two pendula are attached, one fixed on a 
surface and the other fixed to the bob of the first. Let the first bob have mass m and the second 
mass M. Also, assume the pendula are of the same length £. Finally, denote the angles made by 
the first and second pendula with the vertical by x and y respectively. Show that the potential and 
kinetic energies are given by 


V = mel + mgf(1 — cos(x)) + Mgé(2 — cos(x) — cos(y)) 
1 1 
T= meri? ms sme? + y*) + Mex cos(x — y). 


Here we take the point of potential zero to be below the surface from which the double pendulum 
hangs a distance of 2£. Also note that the unit vectors denoting motion around the respective circles 
are X = (cos(x), sin(x)) and y = (cos(y), sin(y)) with the velocity of M given by €(xxX + yJ). 
The kinetic energy then uses the dot product of this quantity with itself. Once the kinetic and 
potential energy have been determined, find the Euler-Lagrange equations for the action integral 
and use Maple to plot the path of the mass M under different initial conditions. 


Example 7.7.14 (The Taut String). 
Suppose a taut string is fixed at both ends and plucked. What are the vibrational modes of the 
string? Think of the height of the plucked string as being the graph of a function u(t, x). Then the 
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kinetic energy is given by T = 1/2 ie pu,(t, x)’ dx at time t where L is the distance between 
the fixed ends of the string and p = p(t, x) is the local linear mass density of the string. The 
maximum amount the string can be plucked is usually small, so we assume the work required to 
bend the string is negligible. The only potential energy comes from elongating the string itself 
past length L where the string has a tensile force t = t(t, x). Here we neglect gravity’s effects 


also. We have 
L 
v= | (isu -1) dx 


at time ft. The action integral for this system is then 


T Ly] 
sf [ suttay—r( f+ 1) dxdt 


There are boundary conditions of course: u(t, 0) = 0, u(t, L) = Oandu(0, x) = uo(x), u,(0, x) = 
v(x) prescribed by the initial pluck. The integrand has the form 


F(t, x, u(t, x), u(t, x), ur(t, x) 


so we must use the two-independent variable Euler-Lagrange formula in Exercise 7.1.8. We then 
get 


d d —Tu, 
(22 (a= — =0 
at (pu;) ay ( Ta rs | 
d d 
ay (Pu) + aye tee) =0 


(Pur): = (Tux)x 
Puy = Ty, 


2 
Uy =O Uyy, 


where, in the second line we used the assumption that |u,| << 1 to eliminate the square root 
(although this is mathematically shaky) and in the last line we assumed that p and t were constant. 
(In fact, even leaving the assumption |u,| << | until after taking the derivative, we wind up 
with the same equation.) The equation u,,; = 07u,, is called the wave equation in one dimension. 
To solve this equation, we try separation of variables. That is, we assume that the solution has 
the form u(t, x) = o(t) f(x) with uj, = A(t) f(x) and uz, = b(t) f(x). The wave equation then 
becomes 


b(t) f(x) = 07 P(t) F(x) 
1¢_ f'@) 
ob f(x) 
The left side is a function of ¢ alone and the right side is a function of x alone, yet the two sides 
are always equal. This can only happen if they are equal to the same constant —A (with A > 0 for 
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physical reasons). We then have the two decoupled equations 


6+i07o =0 
f"+af =0 


with general solutions 


b(t) = Acos(oVAt) + B sin(o Vat) 
f(x) = Ccos(VAx) + D sin(VAx). 
The initial conditions on the string imply C = 0 and, since D sin(/AL) = Oand f #0, VAL = 
nz for an integer n. Hence, A = (nz/L)’. The final solution is then given by 
Co 


wtar)= Do (Asin ("F*) cos (74) + ausin(“Z*) sin (“5°*)) 


n=) 


with uo(x) = °°, An sin (*) and v(x) = SO, B, “F2 sin (*). The fundamental frequen- 
cies of vibration are then 


For low energy, then, we should see a square root dependence of frequency on t and p and 
an inverse linear dependence on L. That is, increase L and linearly the fundamental frequency 
decreases (as any violin player knows). 


To end this section, we want to exhibit yet another of our earlier examples as a constrained 
variational problem. In Section 3.6, we considered the surfaces of revolution with constant mean 
curvature — the so-called surfaces of Delaunay. Recall that these surfaces were characterized by 
a certain differential equation, 


2ah 


Vi+h? 


where a and b are constants. Here, h = h(u) denotes the profile curve (or meridian in the plane) 
of the surface of revolution. Let’s consider the following problem. For surfaces of revolution, fix 


avolume YV =a fhuy du and minimize the surface area S = 27 f h(u),/1 + h'(uy du. What 
are the resulting surfaces? The usual variational setup for such a constrained problem (neglecting 
x in the formula) gives 


Minimize J = i‘ 2h(u)y/ 1 + h’(uy — AA(uy* du. 


Theorem 7.7.15. The extremals for the variational problem above are the surfaces of Delaunay. 


Wt = +b? 
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Proof. Since the integrand does not depend on the independent variable u, we may use the first 
integral f — x(0f/0x) = c in place of the Euler-Lagrange equation. 


2hh’ 
2*hV1 +h? —ah? —h’ | —— J =e 
Vi+h? 


2h(1 +h”) —An2V1 +h? — 2h! =ceV1 +h? 
2h =(c+Ah*)V1 +h? 


where a = —1/A and b = —c/X. Of course this is precisely the equation above characterizing 
surfaces of revolution of constant mean curvature. O 


As we mentioned in Chapter 3, many one-celled creatures exhibit rotational symmetry and 
shapes which seem close to surfaces of Delaunay. D’ Arcy Wentworth Thompson asked whether 
the minimization of surface area under the constraint of fixed volume together with a certain 
biological tendency toward symmetry might be responsible for the observed shapes. Of course 
we have seen in Chapter 4 that compact surfaces of constant mean curvature are spheres, so 
Thompson needed other biological considerations to allow for the physical existence of rounded- 
off “compactified” versions of surfaces of Delaunay. More information on this approach to 
cellular morphology may be found (of course) in [Tho92] as well as in [HT85]. 


7.8 The Pontryagin Maximum Principle 


The subject of optimal control theory provides a slightly different perspective on variational 
questions from the calculus of variations. We will only give a brief outline of one of the main 
results in optimal control — the Pontryagin maximum principle — and how this result may be 
applied in geometry. The true power of the Pontryagin maximum principle is seen in its application 
to discontinuous problems (i.e., the so-called bang-bang controls) such as those found in rocket 
motor control or trajectory control, but we shall only consider simple geometric applications 
here. For an elementary introduction to the principle’s non-geometric applications, see [Pin93]. 
Also, we shall restrict ourselves to two-dimensional problems as we have done so far. 

The main problem in optimal control is to find a path (or trajectory) x = (x;(t), x2(t)) which 
satisfies the system of differential equations 


xy = SiG, u), x2 = als, u) 


(where u = (u(t), u2(t)) is the control) and which minimizes a given cost function 


J= ‘a Fo(x, u) dt. 
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Note that we have reverted to the control theorist’s subscripts in place of the geometer’s su- 
perscripts for coordinate functions. This should ease the transition to further study in optimal 
control. Also, a typical optimal control problem may well fix the initial and final states of the 
system, x(to) = Xo and x(t,) = X;, as well as the initial time f. The final time t, is typically 
undetermined. For example, if a spaceship is to make a landing on the moon in the shortest time, 
it must satisfy a system of differential equations derived from Newton’s Law F = ma together 
with the thrust control of its rockets. Because time is to be minimized, and starting from fo) = 0, 
the cost is J = i 1 dt = fy, so the final time f, is not fixed initially, but is itself determined by 
solving the problem. See [Kir70, p. 247] for a discussion of this problem. 

Now, how would a variationalist approach this problem? We have already seen that various 
types of constraints may be placed on variational problems, so it seems reasonable to simply 
incorporate the system of differential equations into the cost integral as a constraint (see Subsec- 
tion 7.6.3). Because x;(t) = f;(x, u), it is certainly true that 


| Wilt(Xi(t) — fix, u)) dt = 0 


for time dependent Lagrange multipliers y;. Therefore, in order to solve the optimal control 
problem above, instead solve the constrained variational problem 


Minimize J = | | fo + Wilei(t) — fil, u)) + Poea(t) — fax, w)) dr. 


To solve this problem, we will write down the usual first integral for t-independent integrands, the 
Euler-Lagrange equations (for four functions x;(t), x2(f), u;(t), #2(t)) and then take into account 
the endpoint conditions, f; undetermined and x(t,) fixed. 

Because the integrand of J does not involve t explicitly, we may write 


fo + wilai(t) — fix, u)) + We2(%2(t) — a(x, u)) — 41) — t2¥2 = 
or, rather 
fo-Wh-hh=e. (1) 


The Euler-Lagrange equations with respect to x;, x2, u, and u2 are 


Ofo afi afr 


Wha 5 ig 5 (2) 
hte -wll wld 0 
o= Py, A yo (4) 
0 = ey ty, (5) 


Exercise 7.8.1. Verify the Euler-Lagrange equations above. 
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In Remark 7.1.18 (also see Theorem 7.2.4 and the following examples), we mentioned that 
the general condition for extremizing J = [ f dt subject to the terminal endpoint at t; lying on 
a given curve a(t) is 


a 
f(t) + (&(t1) ~ x(t41)) F(t) = 0. 
In our case a(t) = x(t), So & = 0 and we have 


0 
flty) — #(h) ein) = 


We must use a multivariable version of this condition of course. Noting that the integrand of 
J does not depend on uw; or w2 and using F = fo + ¥i(xi(t) — fi(x, w)) + Yoa(ta(t) — fax, u)), 
we obtain an endpoint condition 


; OF : OF 
F(t)) — X1(t1) (hh) — Xa(th) (hn) = 0 
Oxy 0X2 
which is then written out as 
folt) — WDA) — Yet) falts) = 9. (6) 
Now, for convenience, define the Hamiltonian associated to the problem to be 
H=-fpthifitmhh 
and note that equations (1) — (6) imply the following relationships. 
(1) A(t, x, u) = 0, by (1) and (6). 
: 0H 4 
(2) Wi = —z— and fo = -=—. 
ax, 2 
(3) 0H /du, = 0 and 0H /du2 = 0 along an optimal trajectory. 


This heuristic discussion may be strengthened considerably to give the following necessary 
conditions for optimality. 


Theorem 7.8.2. (The Pontryagin Maximum Principle) Suppose u(t) is a control which transfers 
the system with 


(State Equations) x = fi(x, u), X2 = f2(x, u) 


from fixed state Xo to fixed state x, along a path x(t). If u(t) and x(t) minimize the cost 


J= a Jo(x, u)dt, 


fo 


then there exist yy, and yf so that the Hamiltonian H = — fo + W fi + Wo fo obeys: 


(1) Hn, W2, X1, %2, U1, U2) = 0 along the optimal trajectory. 


(2) (Co-state Equations) yy = ——— and Wh = —-—. 
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(3) For each t, H attains its maximum with respect to u at u(t). In particular, 
aH aH 
=) = 


—= —=0. 
uy au2 


and 


Example 7.8.3 (Geodesics). 
Suppose we want to find geodesics on a surface M: x(u, v) with orthogonal metric E and G. We 
can formulate the problem by saying that, from a starting point x9, we wish to control our journey 
to minimize arclength. Because we can write a curve on M in the form a(t) = x(u(t), v(t)), we 
see that we are really interested in controlling the functions u(t) and v(t). Therefore, the optimal 
control problem becomes, find a curve (u(t), v(t)) so that 

“=u, v= U2 


and the integral 


j= | JE® + Gia: = f Jeni + odar 


is minimized. The Hamiltonian associated to the problem is 


H = —,/ Eu? + Gud + wus + your 


dH E,w+G,u3 ; dH Eyui + Gyus 


Se Be ae x at ee 
4 2,/ Eu? + Gu} 0 2,/ Eu? + Gu2 
and u-critical point equations for H, 


oH Eu Gu oH 
ee eS SO ay 


a aed 
| Eut + Gu3 yj Eut + Gus Me 


These last equations say that 


w, = Euy;/ Eu? +Gu3 and Yo = Gu2// Eut + Gud. 


Replacing y; and yy by these quantities and u, and u2 by uw and v in the co-state equations gives 
<( Eu )= E,W +G,i 

dt \ JEW+Gv? 2VE ue +Gi? 

al Gov )= E, uw? + G, v? 

dt\ JEW+Gi 


with co-state equations 


WER +GR 
These are, of course, the “geodesic equations” arising in the proof of Theorem 7.7.1. Hence, the 
optimal trajectories for this problem are geodesics. 


Remark 7.8.4. With respect to the metric on M, the amount of effort expended to move 
from one point to another along an optimal path (i.e., a geodesic) is precisely f lulz ¢ dt= 


f  Euj + Gui dt = J. Therefore, geodesics may be considered to be paths along which a 
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minimum amount of control is needed. Compare this with fuel optimal control problems in 
(Pin93] and [Kir70]. Also, note that we have directly obtained geodesics as locally determined 
minimizers of arclength here. The controls u; and u2 (and thus the cost J = f ,/ Eu? + Gu} dt) 
are Jocal in the sense that we recreate the correct u(t) and u(t) from the state equations and cost, 
both of which rely on local control information. In a way, this is very different, philosophically, 
from trying to find a minimizer among all curves joining given endpoints — the underlying 
concept of the calculus of variations. 


Exercise 7.8.5. Show that an arclength minimizer between (1, 2) and (2, 3) in the plane is the 


straight line y = x + 1 by forming the optimal control problem 


x= uy, y=u2 


Minimize J = | Jur tub dt 


and solving by the Pontryagin maximum principle. 


Exercise 7.8.6. Show that a least area surface of revolution is a catenoid by forming the one- 
dimensional optimal control problem 


y =u, Minimize J = fy Jiu dx 


and solving by the Pontryagin maximum principle. Hint: derive the differential equation y y” — 
y? — 1 =Oand let y’ = w so that y” = w (dw/dy). 


Exercise 7.8.7 (Newton’s Aerodynamical Problem). Newton asked what the optimal shape was 
for a surface of revolution passing through, what he called, a rare medium. Here, the word optimal 
is taken to mean “offers the least resistance to movement through the medium”. Also, it should 
be noted that usual media such as water or the lower atmosphere do not qualify as rare while the 
stratosphere does. Newton’s problem may be cast in the language of optimal control (see [Tik90], 
[AFT87, p. 15] for a derivation). Solve 


y =u, u>0; y(0) = 0, y(a) =}, 


inf paca 
0 14+u2 


where y = y(x) represents the profile curve of the surface of revolution. Solve Newton’s problem 
by determining y using the Pontryagin maximum principle. Hint: after writing down the conditions 
of the principle, consider the minimum of the function 


to minimize cost 


Ww 


rege 


¢ 


What is the minimum for various w’s? Maple may be useful. Note u > 0. Show u = 0 is a local 
minimum at least. How do minima come about? After x is determined parametrically in terms of 
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u, find y, using 


dy _dydx __,dx 

du dxdu > du’ 
Remember to find the constant of integration. The answer to the problem may be found in 
Exercise 1.1.6. 


Since optimal control theory arose in the 1950s in response to the problems of rocket and 
satellite trajectory control, it has revitalized the calculus of variations. We have presented only 
the barest hint of the power and beauty of this subject, but surely, within it hide many more 
applications to the geometry of surfaces. 


7.9 An Application to the Shape of a Balloon 


A Mylar balloon is constructed by taking two circular disks of Mylar, sewing them together 
along their boundaries and then inflating the resulting object with either air or helium. Somewhat 
surprisingly, these balloons are not spherical as one might expect from the fact that the sphere 
possesses the maximal volume for a given surface area. This experimental fact suggests the 
following mathematical problem: given a circular Mylar balloon of deflated radius a,” what will 
the shape of the balloon be when it is fully inflated (i.e., inflated to maximize volume subject to the 
constraint (7.9.1) that Mylar doesn’t stretch)? This question was first raised by W. Paulsen (see 
[Pau94]) who succeeded in determining the radius r, thickness t (i.e., the distance between the 
north and south poles; see Figure 7.5) and volume of the inflated balloon. Paulsen’s answers were 
expressed in terms of the gamma function. Here we want to analyze the balloon in terms of elliptic 
functions. The advantage of this approach is that we can determine an explicit parametrization 


North Pole 0.15 


0.1 


South Pole -0.15 


Figure 7.5. Profile curve for the Mylar balloon. 


Note that choosing a as the deflated radius and r as the inflated radius is the reverse of the notation in [Pau94]. 
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for the balloon and use this to calculate Gauss curvature among other invariants. We note that 
[Opr00] provides a numerical approach to this problem using Maple. The approach of this section 
is modelled after that of [MO03a]. A Maple approach is given in Subsection 7.10.5. (For a nice 
exposition of models for scientific balloons, see [BW04].) 

We start with the mathematical model of the balloon following Paulsen’s original argument. 
When the Mylar disk is inflated, the radius deforms to a curve z = z(x) that we take to be in the 
first quadrant of the xz-plane. Of course, physical intuition implies that the curve proceeds from 
its highest point on the z-axis downward to a point of intersection with the x-axis. This is the 
right-hand side of the curve that, when revolved about the z-axis, produces the top half of the 
balloon. The bottom half is just a reflection of the upper half in the xy-plane. 

Let r be the radius of the inflated balloon (i.e., the distance along the x-axis from x = 0 to 
the point of intersection with the x-axis). Because of its physical properties, the Mylar does not 
stretch significantly. As a result, the arclength of the graph of z(x) from x = 0 to x =r is equal 
to the initial radius a. That is, we have 


[vi eT eee (7.9.1) 
0 


The basic shape of the balloon is determined by this constraint. Next, it is clear that, when the 
balloon is inflated, the pressure of the gas inside induces a maximal enclosed volume. Because 
the balloon is a surface of revolution, this volume is given by the shell method: 


V=4n [ xoas. (7.9.2) 
0 


In this setting we now have the variational problem of extremizing V subject to the constraint 
(7.9.1). The constrained Lagrangian is then given by f(x, z, z’) = 4a xz(x) + AVJ/1 4 2/(x)* and 
the corresponding Euler-Lagrange equation is 


d Az'(x) 
— | ———— | -— 47x = 0. 7.9.3 
dx ( os) ( ) 
This equation is easy to integrate, yielding 


Az’(x) 
V1 z/(x)* 
for some constant of integration C. This constant can be found by taking into account the obvious 


geometrical (transversality) condition z'(0) = 0. Inserting x = 0 into (7.9.4) and using z’(0) = 0 
gives C = 0. Consequently, we obtain 


=2nx7?+C (7.9.4) 


/ 
2 
Ae) ie (7.9.5) 
Yl4tzi(x? A 
The transversality condition lim,_,,- z'(x) = —oo, along with the requirement that the curve 


proceed from its highest point to its intersection with the x-axis without introducing critical 
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points (i.e., z’(x) < 0 when 0 < x <r), allows us to rewrite 4/(2sr) in the form —m*. Now, 
solving (7.9.5) for z'(x), we find that 


x2 


2(x) = -= 


m4 — x4 


and, integrating, that 


r 12 
2(x) = / —— df, 
) x v¥m4s—14 
where the choice of the upper limit of integration comes from the requirement that z(r) = 0. The 
profile curve of the Mylar balloon will be completely determined if we know what m is. Its value 
can be ascertained if we remember that lim,_,,- z’(x) = —oo and note that this will be the case 
if we take m = r. Therefore, in the end we have 


z(x) = (Q<x <r). (7.9.6) 


r t2 
| ae 
x vr4 —t4 


This is an elliptic integral that Paulsen ([Pau94, p. 955}) observes has no closed form solution in 
terms of “elementary” functions. In what follows, however, we find an explicit expression for the 
integral (7.9.6) in terms of elliptic functions and then use this expression to derive various facts 
associated with the geometry of the Mylar balloon. For that purpose we change the integration 
variable from f to u and make the substitution 


t=rcn(u, k), (7.9.7) 


where cn(u, k) is the Jacobi cosine function. The basic properties of elliptic functions enunciated 
in (Section 3.7) and the substitution (7.9.7) reduce (7.9.6) to 


Giy= ci f en’(u, k) dn(u, k) du 
250 fy smu, ky 


Here uo satisfies cn(up, k) = x/r. Also, the identity 


(7.9.8) 


1 —cn*(u, k) = (1 — en?(u, (1 + en?(u, k)) = sn?(u, k\(2 — sn?(u, k)) 


may be helpful in arriving at (7.9.8). Further simplification of (7.9.8) can be achieved by choosing 
the value of the elliptic modulus k appropriately. Letting k = 1//2, we note that the identity 
dn?(u, k) +k? sn’(u, k) = 1 gives 


2(u) = fo (a =) dit. (7.9.9) 


Using Proposition 3.7.5, we can then express z(u) in terms of elliptic functions: 


z(u) =rv2 [e(sa(u, =) =) zs 5F(sn(u, =) =)| (7.9.10) 
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In light of (7.9.7), this means that the profile curve (traced counterclockwise) is given by 
(u) n( ) 
x(u) = ren{u,—~)}), 
J2 


wy = sled) oh) Selle 3) 8] 


for u in [(0, K(1/./2)]. Note that the complete integral K(1/./2) arises because en(u, 1/+/2) 
varies from | (where u = 0) to 0 (where u = K(1/¥/2)). 


(7.9.11) 


Theorem 7.9.1. The surface of revolution S that models the Mylar balloon is parametr- 
ized by 


x= x[u, uv] =(x(u, v0), y(u.v), clu, v)), 


where for u in [—K(1/V2), K(1//2)] and v in [0,277], 


| : 
x(24,0) = ren(u, =e) cos uv, yu v) = ren(u =) sinv, 
(7.9.12) 


a(u,v) = rv2 [e(sn( =) , =) — 5F(sn(u =) : =)| , 


We can now put this parametrization into a computer algebra system such as Maple and plot. 
We then see the familiar shape of a Mylar balloon in Figure 7.6. 

Having the explicit parametrizations of the profile curve (7.9.11) and the surface of the Mylar 
balloon (7.9.12), we now turn to the study of their geometries, Of principal importance is the 
relation between the respective radii of the deflated and inflated balloons. From (7.9.1) and 


Figure 7.6. Two views of the Mylar balloon 


7.9. An Application to the Shape of a Balloon 375 


(7.9.11) we obtain the arclength (where we shorten sn(u, 1/./2) to snu, K(1//2) to K etc.): 


K 
i, Vx'(u)? + z'(u)? du 
0 


2 
[ r? sn2u dn?u + re jontu du 


f ee — smu ) + 5 —sn2u)? du 


r Nee ak Se 


0 Fi V2 


We end up with the relation 


_ Ka/v2), 
ie Se 


We can compute the constants with Maple to see that the numerical relations between a and r 
are: 


(7.9.13) 


a¥13110r, r % 0.7627 a. (7.9.14) 


The thickness t of the balloon, as measured from north pole to south pole, is 2 - z(K (1/2), 
with z given by (7.9.10). That is, 


t = 22(K(1/V2) = 2V2 [Easy = sKaiv9) r. (7.9.15) 


Again, numerical calculations can be done with the aid of a computer algebra system. They reveal 
that cr © 1.1981 r + 0.91394. 

In order to compute the volume of the balloon, we notice first that, by substituting expression 
(7.9.6) for z(x) in (7.9.2), switching the order of integration, and integrating with respect to x, 
the volume integral can be put into the form 


r 4 
ve 2x | a gh (7.9.16) 
0 


rt — 74 


Next, after the same change of variable (7.9.7) as before, this integral becomes (where, as usual, 


K = K(1/V2)) 


K 
2r° i an*(u, =) du. (7.9.17) 
0 


It can be verified either numerically or by taking m = 0 in the reduction formula (see [Gre92]) 


1 
four du = (m+ Dd —) E en™* ly snu dnu 


+  (m+2)(1 — 2k?) | en™**y du + (m + 3)k? | en™*4y au 
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that fo" en*(u, 1/-/2)du = K(1//2)/3. Therefore, we obtain 


DN 
ves (=) °. (7.9.18) 


Numerical calculation in both cases gives V © 2.7458 r? © 1.2185 a°. 

Of course, we wish to show that elliptic functions can provide more information about an 
object than numerical calculations alone. We have seen this a bit already from an analytic point 
of view, but now we will see it in full force geometrically. Namely, we shall derive the differential 
geometric characteristics of the balloon in terms of elliptic functions and then apply these results 
to study its curvatures. 

From the relation dn?(u, k) + k?sn?(u, k) = 1, the formulas in (3.7.3), and the choice k = 
1/./2, we calculate the coefficients of the first and second fundamental forms for the surface S 
of Theorem 7.9.1: 


1 ] 
l=rcnl(u,—), m=O, n=ren (w=). 
( a af 2 


While the metric coefficients are computed easily by hand, a computer algebra system is very 
helpful for calculating /, m, and n. 

Our first application of these calculations gives us something that is quite surprising. We saw 
that the formula for the volume of the Mylar balloon involves the complete elliptic integral of the 
first kind, so we might expect that a formula for surface area would be equally as complicated. 
To the contrary, we have: 


Theorem 7.9.2. The surface area of the Mylar balloon S of inflated radius r is given by A(S) = 
ger’, 


Proof. The surface area element d A(S) is given by 


2 
dA(S) = VEG — F' dudu = VEG dudv = re aude, 


Now it is quite easy to find the total surface area A(S) of the Mylar balloon S by computing the 
following integral (where, once more, K = K(1/ J2)): 


jue 


2x K 


i) en( =) dudv 


A(S) 


r 


v2 


Kl 
N N 
oe 
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y2 fen(u, 1/v2) dn(u, 1/v2) 

= 4x — | ————_—_.—_——-d 
al dn(u, 1/V2) " 
r? ! dw 

= 4n =e ear (w = sn(u, 1/V2)) 

1 

— an V3 arcsin (=) ; 

— ag tN? 

~ ye 

=> mr?, 


O 


We next focus on qualities of the balloon that are central to its shape. We easily obtain the 
curvatures for the balloon from the coefficients of the first and second fundamental forms and the 
usual formulas for K and H from Section 3.2. 


ren(u, 1/2) ren? (u, 1/2) - 2en(u, 1/v2) 


r2/2- Pen?(u, 1/2) r 


+ 


2 (r2/2-r2en®(u, 1/v2)) 2r 


These formulas actually allow us to verify our intuition about one particular aspect of the 
balloon’s geometry. When we look at the balloon, we “see” the north and south poles as being 
“flat,” but it is difficult to make this precise. (Although we have seen that the profile curve of the 
balloon meets the z-axis perpendicularly, this is not sufficient to ensure “flatness,” as the example 
of the sphere shows.) Now, however, we can establish a geometric result telling us that the poles 
are very flat indeed. 


Theorem 7.9.3. The north and south poles of the Mylar balloon are planar points {i.e., points at 
which normal curvatures are zero in all tangent directions). 


Proof. The north pole of the balloon corresponds to u = K(1//2), and we know that 
en(K(1//2), 1//2) = 0. Therefore, we see from the formulas for K and H that both the 
Gauss curvature K and the mean curvature H are zero at this point. Hence, by Exercise 3.1.7, 
we have k,; = 0 and kz = 0. Since these are the maximum and minimum normal curvatures, we 
see that all normal curvatures at the north pole are zero. The same is true for the south pole by 


symmetry. O 
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The Gauss and mean curvatures satisfy K = (8/9)H*. From Exercise 3.1.7 , we also see 
that the principal curvatures (which are the principal curvatures for a surface of revolution; see 
Exercise 3.3.5) satisfy 


2cn(u, | 2 
page 


= 2k, =2ky (7.9.19) 
for all u. Either of these relationships identify the Mylar balloon as a very special type of 
Weingarten surface (i.e., a surface whose principal curvatures satisfy a functional relation). 
Surprisingly, this relation between principal curvatures actually characterizes the balloon. 


Theorem 7.9.4. A surface of revolution M parametrized by 
x(u, v) = (h(u) cosv, h(u)sinv, g(u)) 


such that k,, = 2kq has a parametrization of the form (7.9.12). That is, the surface M is a Mylar 
balloon. 


Proof. By re-parametrizing, we may without loss of generality assume that the profile curve 
(A(x), g(u)) of M has constant speed. (Note that the parametrization of the Mylar balloon that we 
have given satisfies this without re-parametrization: ,/x’(u)? + z’/(u)? = rK/V2; see (7.9.13).) 
We thus have g’(u)* + h’(u)* = A* for some constant A > 0. By differentiating this relation with 
respect to u, we find that g’g” + h’h” = 0. In tandem, these two relations reduce the principal 
curvature formulas of Exercise 3.3.5 to 


g g 
k=, k,; = —. 
e AR "~~ Ah 
The hypothesis k,, = 2k, then gives 
wn h’ 
gh ak 
8’ h 


which ensures that g’= ah? for some constant a. We now insert this into the constant 
speed relation to obtain dh/du = h’ = +/ A* — ah‘. This is a separable differential equa- 
tion that can be solved by making the substitutions h = ./A/a cnw, dh = —./A/a snw dnw dw 
(where the elliptic modulus is always taken to be 1/./2) and using the identity 1 ~ cntw = 
(sn?w)(2 — sn*w): 


dh 
+A eee rarer: 
ee es - 
tates f V2 sn(w) oo, iz | = 


{1 - Esn(w? 
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whence /2a Au +c = w, where ¢ is a constant. We then apply the elliptic cosine and use the 
substitution relation to obtain 


[Sa = en(V2a Au +2), 
hii = [4 ca(Viarau +2). 


We take ¢ = 0 for convenience and find g() by recalling that g’ = wh* = Acn*(/2a Au) and 
invoking Proposition 3.7.5: 


| 


gu) A [o? (viwAu) du 


ca (=) 
JF (e(w)-b (oon). 


This results in a parametrization of type (7.9.12). (Note that, for the balloon in (7.9.12), A = 
rK//2 anda = K/(V2r).) O 


It is possible to place Theorem 7.9.4 into a broader context: we know that there are certain 
other situations where imposing a condition on principal curvatures characterizes a surface M. 


© If we require that k; = ky at every point of a compact surface M, then M is a sphere by 
Theorem 3.5.2. 


© If we insist that M be a surface of revolution for which k, = —k, then H = (k, + k,)/2=0 
and M is a minimal surface of revolution. By Theorem 3.5.7, M is a catenoid. 


e If we require M to be a surface of revolution and k; = —k2 +c, where c is a nonzero 
constant, then M is a surface of Delaunay by Theorem 3.6.1. 


Every soap film is a physical model of a minimal surface (but not conversely) and soap films 
arise from the variational principle of minimizing “energy” in the form of surface area (see 
Chapter 4). Soap films also obey the Laplace-Young equation of Section 4.1 with proportionality 
constant the surface tension of the film. Of course, a film has the same pressure on both sides, so 
the mean curvature is zero and k, = —k. Natural questions that arise are: 

1. Is there a type of Laplace-Young equation for the Mylar balloon? In other words, are there 
physical entities akin to surface tension and pressure that provide a non-variational characteriza- 
tion of the balloon as in Theorem 7.9.4? 

2. Are there characterizations of surfaces satisfying restrictions k; = ck, for c 4 —1, 1,2? 
Are there any physical processes that produce these restrictions? 

Here is an exercise that is in the long tradition of proving analytic results geometrically. 


Exercise 7.9.5. The total Gauss curvature for the Mylar balloon is given by the Gauss-Bonnet 
theorem: [. 5 K dA = 2m x(S) = 42, where x(S) = 2 since the balloon is homeomorphic to 
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a sphere and the Euler characteristic of a sphere is equal to 2. Now use the formula K = 
2cn? (u, 1/ A) /r? and the area element for S (see Theorem 7.9.2) to compute f, K dA directly 
to discover geometrically the analytic relation 


K(1/¥2) 5 1 1 
cm | u,— |] du = —=. 
f(a) uz 


For more about the Mylar balloon, see [MO03a]. In particular, closed geodesics are found on 
the balloon in a manner akin to the way they were found on the unduloid in Subsection 5.6.4. 


7.10 The Calculus of Variations and Maple 


7.10.1 Basic Euler-Lagrange Procedures 


In this section, we present some Maple procedures which are useful in finding and solving the 
Euler-Lagrange equations. Further, we apply these to various problems such as the problem of 
a marble rolling inside a (frictionless) hemispherical bowl under the influence of gravity. The 
result will be the plot of the motion of the marble. We also plot different modes of buckling of a 
beam under compression. First, let’s take a look at the Euler-Lagrange equation in one variable. 


> ELi:=proc(f) 

local parti,part2,p2,dfdx,dfddx; 
part1l:=subs({x=x(t) ,dx=diff(x(t) ,t)},diff(f,x)); 
dfdx:=diff (f ,dx); 

dfddx:=subs({x=x(t) ,dx=dx(t)},dfdx) ; 
p2:=diff(dfddx,t) ; 

part2:=subs (dx(t)=diff (x(t) ,t) ,p2); 

simplify (parti-part2=0) ; 

end: 


Notice that we have to do a bit of substituting to ensure that the answer comes out in a form 
Maple recognizes as a differential equation. You might ask, why didn’t we input the function f 
in differential form? The answer is just that we become used to writing f(t, x, x) =x? — x4? — 
2x sin(t) for the integrand of a variational integral. Rather than write x as d(x(t))/dt (where we 
also have to tell Maple that x is a function of f), it seems easier to write 


s f :=x72-dx*2-2*x*sin(t) ; 
f= x? — dx’ —2x sin(t) 


and understand ‘dx’ to mean x. Further, the procedure above, once we take the time to write it, 
takes care of making x a function of t and writing the final version in differential equations form. 
Here’s an example. 

> EL1(f); 


2 
2 x(t) — 2sin(t) + 2 (3 x(t)) = 0 
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Now, solve the Euler-Lagrange equation by 


> dsolve({EL1i(f) ,x(0)=1,x(Pi/2)=2},x(t)); 


x(t) = 2 sin(t) + cos(t) — ; cos(t)t 


Now consider the problem of extremizing the integral { x cos(t) + x?/2.dt subject to x(0) = 0, 
x(1/2) = 1. We write f2 below for the integrand to avoid confusion. 


> £2:=x*cos(t)+1/2*dx72; 
f2 := x cos(t) + a 
> EL1(£2); 
d 
cos(t) — (& (0) =0 


> dsolve({EL1(f£2) ,x(0)=0,x(Pi/2)=1},x(t)); 
x(t) = —cos(t) + 1 


Notice that the initial and final states are used to determine all arbitrary constants. 


Exercise 7.10.1. Carry out a Maple solution of the problem: extremize h(t, x, x) = x7/1° subject 
to the conditions x(1) = 2 and x(2) = 17. 


Exercise 7.10.2. Extremize h(t, x, x) = x? + 2x sin(t) subject to the conditions x(0) = 0 and 
x(1r) = 0. 


The following procedure handles the case where f does not depend on t. 


> EL2:=proc(f) 

local parti,part2,dfddx; 
part1:=subs({x=x(t) ,dx=diff (x(t) ,t)},£); 

dfddx:=diff (f,dx); 

part2:=diff (x(t) ,t)*subs({x=x(t) ,dx=diff(x(t),t)},dfddx) ; 
simplify (part1-part2=c) ; 

end: 


For example, 


> k:=dx72/x73; 
de 
ae 
> EL2(k); 
_(gxoy 


xe 
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> dsolve({diff(1hs(EL2(k)) ,t)=0,x(0)=1,x(2)=4},x(t)); 
16 


8 16 
9(2@--1+— 
(a) 


Maple used to give two answers here. One was incorrect as could be seen by computing the 
integrals for the integrand k. 


x(t) = x(t) = 


16 
t2—814+ 16’ 


> int(16/(t-4)°2,t=0..2); 


> int (16/(9*(t-4/3)~2) ,t=0..2); 


Now Maple 10 seems to want to give answers in terms of “Root__of”. To remedy that, apply the 
command “allvalues” to the dsolve command (or to “Root__of”). 

The following example shows how isoperimetric problems may be handled. The problem 
is to minimize J = f %? dt with x(0) = 2, x(1) =4 and subject to the constraint f x dt = 
1. Note that we use the Maple command “rhs” which takes the right-hand side of a Maple 
equation. 


> ki:=dx"2-lambda*x; 


> EL1(k1); 


> dsolve({EL1(k1) ,x(0)=2,x(1)=4},x(t)); 


At? a 
x(t) = -—- + [-—+2]1t4+2 


4 4 
> al:=rhs(dsolve({EL1(k1) ,x(0)=2,x(1)=4},x(t))); 
At? rN 
i - 2)¢+2 
a 4 +(F+ ) De 
> a2:=int(al,t=0..1); 
Xr 
2:=—+4+3 
a a 
> lam1:=solve(a2=1, lambda) ; 
lamI := —48 


> subs (lambda=lam1 ,dsolve({EL1(k1) ,x(0)=2,x(1)=4},x(t))); 
x(t) = 1217 — 101 +2 
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Exercise 7.10.3. Consider the problem of extremizing f x?/2 4 x %dx with x(0) = 0, x(1) =0 
and subject to iC x dt = 7/12. Follow the steps below and complete the problem. 


> gg:=1/2*dx"2+x*dx-lambda*x; 


] 
gg i= 5 de + xde—Ax 


da? 
-h- (G x(0) =0 


> xt:=rhs(dsolve({lhs (EL1 (gg)) ,x(0)=0,x(1)=0},x(t))); 


> ELi(gg); 


t een 
xti= x = 
2 2 


> solve(int(xt,t=0..1)=7/12, lambda) ; 
7 


Exercise 7.10.4. Use Maple to solve the problem of Example 7.6.2 by following the commands 
below. The final result is the curve depicted in Figure 7.3. 


v 


Lag:=1/2* dx*2-lambda*x; 
> EL1 (Lag); 


> de:=dsolve({ELi (Lag) ,x(0)=0,x(1)=0},x(t)); 


] 1 
de := x(t) = ZAP + oat 


> In:=int(rhs(de),t=0..1); 


> L:=solve(In=1/6,lambda) ; 


> xx:=subs(lambda=L, de); 


xxi=x(t)=—? +t 


> sh:=dsolve({D(x) (s)=cos(-s*2ts) ,D(y) (s)=sin(-s*2+s) , 
x(0)=0,y(0)=0},{x(s),y(s)}, type=numeric): 

> odeplot(sh, [x(s),y(s)],-5..7,view=[-1..2,-1..1], 
numpoints=400) ; 
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7.10.2 Buckling under Compression 


Now let’s consider how to calculate and depict the buckling states of a column under a compressive 
force P as in Example 7.4.5. We use the x-notation instead of the y-notation because that is how 
our Maple procedures are set up. The first three buckling modes are shown below. We begin 
with an Euler-Lagrange procedure for one independent variable with second-order derivative. 
We will not use this in the buckling problem, but it is useful in other problems where the square 
of curvature is extremized. 


> EL2ndOrd:=proc(f) 
local parti,part2,part3,dfdx ,dfddx,dfd2x ,d2tdfd2x; 
part1:=subs({x=x(t) ,dx=diff (x(t) ,t),d2x(t)=diff (x(t) ,t$2)}, 
diff (f,x)); 
dfdx:=diff (f,dx); 
dfddx:=diff (subs ({x=x(t) ,dx=dx(t) ,d2x=d2x(t)},dfdx) ,t); 
part2:=subs ({dx(t)=diff (x(t) ,t) ,d2x(t)=diff (x(t) ,t$2)}, 
dfddx) ; 
dfd2x:=diff (f,d2x); 
d2tdfd2x:=diff (subs ({x=x(t) ,dx=dx(t) ,d2x=d2x(t)},dfd2x) ,t$2); 
part3:=subs({dx(t)=diff (x(t) ,t),d2x(t)=diff (x(t) ,t$2)}, 
d2tdfd2x) ; 
simplify (part1-part2+tpart3=0) ; 
end: 
> kappa2:=d2x72/(1+dx72)~ (5/2) ; 
d2x? 
~ (1+ de?)62) 


> EL2ndOrd(kappa2) ; 


d d a 
(-29(2%) ($0) (H)- (FZ) (Fx) (Ge) 
da? d 2 da 3 d‘ a‘ d e 
+30 (S «) (= x() ~5 (S x) +3 (= x() +4 (= x() (= x) 
4 4 (9/2) 
+2(53x0) (G2) ) /(1+ (Ga) ) =" 


When the integrand f does not depend on x, then this can be simplified by integrating with respect 
to t once. This is the approach to buckling as in Example 7.4.5. 


> ELxindep20rd:=proc(f) 

local parti,part2,dfdx,dfd2x,d2tdfd2x; 

dfdx:=diff(f,dx); 

parti :=subs({x=x(t) ,dx=diff (x(t) ,t) ,d2x=diff (x(t) ,t$2)}, 
dfdx) ; 

dfd2x:=diff (f,d2x); 

d2tdfd2x:=diff (subs ({x=x(t) ,dx=dx(t) ,d2x=d2x(t)},dfd2x) ,t); 
part2:=subs ({dx(t)=diff (x(t) ,t) ,d2x(t)=diff (x(t) ,t$2)}, 
d2tdfd2x); 

simplify (-parti+part2=c) ; 

end: 
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Now let’s apply the procedure above to the integrand of Example 7.4.5. The first three buck- 
ling modes of the column under compressive loading are shown in Figure 7.8, Figure 7.9 and 
Figure 7.10. 


> buckle:=mu*kappa2-P* (sqrt (1+dx72)-1); 


a pd2x* ) 


> ELxindep20rd (buckle) ; 
7 (ss (Sx) (4x0) - (4x0) -3” (4x0) - 
3P (5 0) —P (= «) —2u (4 x0) om 
2 (San) (4x0) [i+ (a0) =e 


> Pxdiff (x(t) ,t)+2*murdiff (x(t), ‘$‘(t,3))=c; 


P g 2 cl = 
(5 (0) +2p (G x(0) =C 


Letting w? = P/(2), we obtain 


Small deflection theory gives 


> dsolve(diff (x(t) ,‘$‘(t,3))+omega*2*diff (x(t) ,t)=c,x(t)); 


qe -C2 cos(w t) ie -CI sin(@ f) = 463 

w w 
> eqi:=eval(-_C2/omega*cos (omega*t) + 
_Ci*sin(omega*t) /omegat+1/omega”2*c*t+_C3=0 ,t=0) ; 
eq2:=eval (diff (-_C2/omega*cos (omega*t)+ 
_Ci*sin(omega*t) /omegat1/omega~2*c*t+_C3,t)=0,t=0) ; 
eq3:=eval (diff (-_C2/omega*cos (omega*t) + 
_Ci*sin(omega*t) /omegat1/omega*2*c*t+_C3,t$2)=0,t=L) ; 

eq] := ee + .C3 =0 
a) 


eq? = C1 + =0 


eq3 := _C2cos(w L) w — _C] sin(w L)w = 0 


> solve({eq1,eq2,eq3},{_C1,_C2,_C3}); 


csin(w L) _ __e¢sin(@w L) 
~  @cos(@L)’ sw cos(w L) 


386 7. The Calculus of Variations and Geometry 


> x(t)=eval(-_C2/omega*cos(omega*t)+ 

_Ci*sin(omega*t) /omega+1/omega*2*c*#t+_C3,{_C2 = 
~c*sin(omega*L) /omega~2/cos(omega*L), _C3 = 
-c*sin(omega*L) /omega*3/cos(omega*L) ,_C1 = -1/omega*2*c}); 


(t) csin(w L)cos(wt)  csin(wt) ts ct c sin(w L) 
x(t) = ———_._. - — + 5 - as 
w cos(w L) wo w ow cos(wL) 
> x(t)=c*tan(omega*L) /omega*3*cos (omega*t)- 
1/omega~3*c*sin(omega*t) +1/omega*2*c*t- 
c*tan(omega*L) /omega“3; 
ctan(wL)cos(wt) csin(wt) ct  ctan(wL) 
a aa Sa SE LE 
w w w w@ 
> simplify (eval (c*tan(omega*L) /omega~3*cos(omega*t) - 
1/omega”3*c*sin(omegatt)+1/omega” 2*c*t- 
c*#tan(omega*L) /omega~3=0,t=L) ,trig) ; 


c(w L cos(w L) — sin(w L)) _ 


0 
w cos(w L) 


> -L*omega*cos(omega*L) tsin(omega*L)=0; 


—w L cos(w L) + sin(w L) = 0 


> tan (omega*L)=omega+*L; 


tan(aL)=@wL 
Now switch to x — y variables and take c = | and tan(w L) = L. 


> y:=1/omega*2*x-1/omega*2*L+L/omega*2*cos (omega*x) 
-1/omega”3*sin(omega*x) ; 


x L Leos(@mx) — sin(wx) 
w ow w? wo 


> plot ({tan(t),t},t=0..20,view=[0..20,0..20]); 


Figure 7.7 shows where tan(t) = t. These intersection points can be found using Maple’s “fsolve” 
command. Note that it helps to specify an interval (e.g., 0.5..5) in order to get a good floating 
point solution. 


> whL:=fsolve(tan(t)=t,t,0.5..5); 
WL := 4.493409458 
> yy:=subs(L=2, subs (omega=wL/L,-y)) ; 
yy = — 0.1981107313 x + 0.3962214626 — 0.3962214626 cos(2.246704729 x) 
+ 0.08817835680 sin(2.246704729 x) (7.10.1) 


> plot({x,yy,x=0..2],scaling=constrained,view= 
{0..2,0..0.6]); 
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20 


Figure 7.7. Intersection of ¢ and tan(t) 
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Figure 7.8. The first buckling mode 


We now take other solutions to tan(t) = ¢ to get other buckling modes. 


> wL2:=fsolve(tan(t)=t,t,2*Pi..5*Pi/2); 
WL2 := 7.725251837 


> yy2:=subs (L=2,subs (omega=wL2/L,-y)); 


yy2 := — 0.06702467232 x + 0.1340493446 — 0.1340493446 cos(3.862625918 x) 
+ 0.01735210029 sin(3.862625918 x) (7.10.2) 


> plot([x,yy2,x=0..2],scaling=constrained,view= [0..2,-0.2..0.3]); 
> wL3:=fsolve(tan(t)=t,t,3*Pi..7*Pi/2); 


WL3 := 10.90412166 
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Figure 7.10. The third buckling mode 


> yy3:=subs (L=2, subs (omega=wL3/L,-y)); 


yy3 := — 0.03364175274 x + 0.06728350549 — 0.06728350549 cos(5.452060830 x) 
+ 0.006170465406 sin(5.452060830 x) (7.10.3) 


> plot([x,yy3,x=0..2],scaling=constrained,view= 
[0..2,-0.1..0.2]); 


Exercise 7.10.5. Now apply Maple to the problems of the cantilevered beam (Exercise 7.4.6) 
and the thin elastic rod (Exercise 7.4.10). 


7.10.3 The Double Pendulum 


The following procedure gives the Euler-Lagrange equations for an integral of the form 


[t. x(t), y(t), dx(t), dy(t)) dt 


where dx and dy denote the t-derivatives z x and z y respectively. All functions f which are 


inputs to this procedure must then use this notation: t, x, y, dx, dy. 


> ELsystem:=proc(f) 

local ell,el2,part1,part2,part3,part4,p2,p4,dfddx,dfddy; 
part1:=subs({x=x(t) ,y=y(t) ,dx=diff (x(t) ,t) ,dy=diff(y(t),t)}, 
diff (f,x)); 
dfddx:=subs({x=x(t) ,dx=dx(t) ,y=y (t) ,dy=dy(t)},diff(f£,dx)); 
p2:= diff (dfddx,t); 

part2:=subs ({dx(t)=diff (x(t) ,t) ,dy(t)=diff (y(t) ,t)},p2); 
el1:=simplify(parti-part2)=0; 
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part3:=subs({x=x(t) ,y=y(t) ,dx=diff (x(t) ,t) ,dy=diff (y(t) ,t)}, 
diff(f,y)); 

dfddy:=subs ({x=x(t) ,dx=dx(t) ,y=y(t) ,dy=dy(t)}, diff (f,dy)); 
p4:=diff (dfddy,t) ; 

part4:=subs ({dx(t)=diff (x(t) ,t) ,dy(t)=diff (y(t) ,t)},p4); 
e12:=simplify (part3-part4)=0; 

el1,el2; 

end: 


Here, we consider the situation of Exercise 7.7.13. For the double pendulum, we get the following 
Euler-Lagrange equations. Here, the first pendulum is attached at one end to (0, 0) and has a bob 
of mass m while the second pendulum is attached to the first at the bob m and has its own bob 
of mass M swinging freely. The input / is the common length of the pendula. Of course, g is 
the gravitational constant 9.8 meters per second squared. The next small procedure gives the 
Euler-Lagrange equations for the system of the pendula. 


> ELeqs:=proc(m,M,1) 

local L,T,V,g; 

g:=9.8; 

T:=1/2*m¥1°2*dx"2 + M¥1°2*cos(x-y) *dx¥*dy + 
1/24*M*17 2% (dx* 2+dy~2) ; : 
V:=m*g*]+m*g*1* (1-cos (x) )+M*g*1* (2-cos(x)-cos(y)); 
L:=T-V; 

ELsystem(L) [1] ,ELsystem(L) [2]; 

end: 


Here are the equations for general m, M and 1. 
> ELeqs(m,M,1); 


a 
— 0.2000000000 /(49. m sin(x(t)) + 49. M sin(x(t)) + 5.m1 (S x) 


d - a 
+ 5.M Isin(x(t) — 1. y(t)) (4 0) + 5.M I cos(x(t) — 1. y(t)) (G 1) 


2 
+5.M1 (S x()) ) = 0, —0.2000000000 M (4s. sin(y()) 


d ' d? 
— 5.J sin(x(t) — 1. y(t)) (5 x() + 5.1 cos(x(t) — 1. y(t)) (Fa x(0) 


ad 
+ 5.1 (= ))) =0 


Now here is a procedure which solves the Euler-Lagrange equations numerically and then an- 
imates the corresponding motion. The m, M and / are as above while the initial conditions are 
given by x0 (the angle of the first pendulum from the vertical), yO (the angle of the second 
pendulum from the vertical), Dx0 and Dy0 (the initial angular speeds), T (how long the solution 
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should run) and N (the smoothness factor of the animation). 


> doubpend:=proc(m,M,1,x0,y0,Dx0,Dy0,T,N) 

local sys,desys ,xx,yy; 

sys:=(ELeqs(m,M,1) [1] ,ELeqs(m,M,1) [2]]; 
desys:=dsolve({sys[1] ,sys[2] ,x(0)=x0,y(0)=y0,D(x) (0)=Dx0, 
D(y) (0)=Dy0}, {x(t) ,y(t)},type=numeric, output=listprocedure) ; 
xx:=subs(desys,x(t)); yy:=subs(desys,y(t)); 
display(seq({plot([[0,0] , [l*sin(xx(T#t/N)) ,-l*cos(xx(T#t/N))], 
[l*sin(xx(T#t/N) )+l*sin(yy (T#t/N)) ,-l*cos(xx(T#t/N))- 
l*cos(yy(T#t/N))]] ,color=blue,symbol=box)},t=0..N) ,view= 
[-2*1..2*1,-2*1..1] ,scaling=constrained, insequence=true) ; 
end: 


In order to get still pictures, we write a slightly different procedure. Compare the two to see the 
difference. 


> doubpend2:=proc(m,M,1,x0,y0,Dx0,Dy0,T,N,s) 

local sys,desys,xx,yy; 
sys:=[ELeqs(m,M,1) [1] ,ELeqs(m,M,1)[2]]; 
desys:=dsolve({sys [1] ,sys[2] ,x(0)=x0,y(0)=y0,D(x) (0)=Dx0, 
D(y) (0)=Dy0}, {x(t) ,y(t)},type=numeric, output=listprocedure) ; 
xx:=subs(desys,x(t)); yy:=subs(desys,y(t)); 
pointplot([[0,0], [l*sin(xx(T*s/N)) ,-l*cos(xx(T#*s/N))], 
[1l*sin(xx(T+*s/N) )+l*sin (yy (T*s/N) ) ,-l1*cos (xx (T#s/N))- 

l*cos (yy (T*s/N))JJ , color=blue, symbol=box, connect=true, view= 
(-2*1..2*1,-2*1..1],scaling=constrained,xtickmarks=0, 
ytickmarks=0) ; 

end: 

> doubpend2(1,2,2,Pi/4,3*Pi/4,0,0,10,45,0); 

> doubpend2(1,2,2,Pi/4,3*Pi/4,0,0,10,45,3); 

> doubpend2(1,2,2,Pi/4,3*Pi/4,0,0,10,45,6) ; 

> doubpend2(1,2,2,Pi/4,3*Pi/4,0,0,10,45,10); 

> doubpend2(2,2,2,Pi/4,3*Pi/4,0,0,10,45,12); 

> doubpend2(2,2,2,Pi/4,3*Pi/4,0,0,10,45, 18); 


See Figure 7.11 for a sequence of pictures of the double pendulum. Note that the pictures go from 
left to right, then down to the next level and left to right again. For an animation do the following. 


> display (seq(doubpend2(1,2,2,Pi/4,3*Pi/4,0,0,30,100,i), 
i=0..100) , insequence=true) ; 


Exercise 7.10.6. Try out other combinations of m, M, x0 and yO and compare with a real double 
pendulum! 


Exercise 7.10.7. Carry through a similar Maple analysis (with animation) of the spring-pendulum 
system discussed in Example 7.7.12. 
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Figure 7.11. The motion of a double pendulum 


7.10.4 Constrained Particle Motion 


In Subsection 7.6.2, we discussed the motion of a particle constrained to stay on a surface under 
the influence of a potential (usually gravity). Here we want to use Maple to create the motion of 
the particle as in Figure 7.4. First, we parametrize the sphere. 


> sph:=<cos(u)*cos(v) |sin(u)*cos(v) |sin(v)>; 


sph := [cos(u)cos(v), sin(u) cos(v), sin(v)} 


Now we use the “EFG” and “ELsystem” procedures to create a new procedure that gives the 
Euler-Lagrange equations of motion for the particle (e.g., marble) in the spherical bowl. Note 
that the kinetic energy is given by the metric (with F = 0 as usual) and the time derivatives of 
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the parameters. The potential energy is taken to be the third coordinate of the parametrization in 
accordance with the gravitational potential mgh. 


> Equat:=proc(X) 

local Metric,TT,VV,LL,Eul,eqi,eq2; 
Metric:=subs({u=x, v=y},EFG(X)); 

TT :=1/2* (Metric [1]*dx72+ Metric[3]*dy~2); 
VV: =subs ({u=x, v=y},X[3]) ; 

LL:=TT-VV; 

Eul:=ELsystem(LL) ; 

eq1:=subs({x(t)=u(t) ,y(t)=v(t)},Eul[1]); 
eq2:=subs ({x(t)=u(t) ,y(t)=v(t)},Eul[2]); 
eqi,eq2; 

end: 


> Equat (sph) ; 


2 d ; d d? oe, 
— cos(v(t)) (- (= 0) sin(v(t)) (= “) + cos(v(t)) (a w)) = 0, 


d 2 a 
— cos(v(t)) (3 u()) sin(v(t)) — cos(v(t)) — (S “) =0 


These equations can be taken within a procedure and then solved numerically. The numeric 
solution can then be plotted to allow us to “see” the path of the particle. The procedure is 
analogous to “plotgeo”, the procedure for plotting geodesics. Of course, this jibes with Jacobi’s 
theorem relating particle motion to geodesics also. 


> plotmotion2:=proc(X,ustart,uend,vstart,vend,u0,v0, 
Du0,Dv0,T,N,gr,ori1,ori2) 

local sys,desys,ul,v1,U,V,motion,plotX,yyy; 
sys:=(Equat (X) [1] ,Equat (X) [2]]; 

desys :=dsolve({sys[1] ,sys[2] ,u(0)=u0,v(0)=v0,D(u) (0)=Du0, 
D(v) (0)=Dv0}, {u(t) ,v(t)},type=numeric, output=listprocedure) ; 
ui:=subs(desys,u(t)); vi:=subs(desys,v(t)); 

motion: =tubeplot (convert (subs (u=’ul?’ (t) ,v=’vi’ (t) ,X), list), 
radius=0.01,t=0..T,color=black, numpoints=N): 
plotX:=plot3d(X,u=ustart..uend,v=vstart..vend,grid= 
[gr[1],gr[2]],shading=XY): 

display ({motion, plotx},style=patch,scaling=constrained, 
orientation=[orii,ori2]); 

end: 


The following two commands produce the motions depicted in Figure 7.4. 


> plotmotion2(sph,0,2*Pi,-Pi/2,0,0,-Pi/4,2,0,12,100, 
[30,10] ,41,0); 
> plotmotion2(sph,0,2*Pi,-Pi/2,0,0,-Pi/4,2,0,12,100, 
[30,10] ,39,85); 
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Exercise 7.10.8. Now do Exercise 7.6.13. Namely, find the equations of motion of a marble in a 
paraboloid shaped bowl and plot the motion for varying c. Try 


> par:=[v*cos(u) ,v*sin(u) ,v72]; 


> plotmotion2(par,0,2*Pi,0,2.5,0,2,1,0,25,150, 
(30,12] ,0,60); 


Note that the parametrization is chosen so that “plotmotion2” can be used without 
modification. 


7.10.5 Maple and the Mylar Balloon 


In order to use Maple for the Mylar balloon, we must make a correction to its Elliptic E routine. 
In going from Maple 9 to Maple 10, an error was introduced into the procedure for Elliptic E. In 
order to correct this, give the command (due to Alec Mihailovs) 


> ‘evalf/Elliptic/E11_E‘ :=parse(StringTools:-Substitute 
(convert (eval (‘evalf/Elliptic/El1_E‘),string) ,"F_0","E_0")): 


Maple also seems to have changed its “simplify” command to its detriment. To partially correct 
this, define the command 


> mysimplify:=a->‘simplify/do‘ (subsindets(a,function, 
x->sqrt (x"2)) ,symbolic): 


and then apply it to any expression that is to be simplified further. 
The following is the parametrization of the Mylar balloon (see Section 7.9)of radius R. 


> MylarR:=[R*JacobiCN(u,kk) *cos(v) ,R*JacobiCN(u,kk)*sin(v), 
R/sqrt (2)*(EllipticE(JacobiSN(u, kk) , kk) /kk*2-(1-kk*2) /kk*2* 
EllipticF(JacobiSN(u,kk) ,kk))]; 


1 
MylarR := [R JacobiCN(u, kk) cos(v), R JacobiCN(u, kk) sin(v), ae /2 


(EuinicttaobisN kk), kk) (1 — kk) EllipticF(JacobiSN(u, kk), »)) 
ny > 


The profile curve of the balloon is depicted in Figure 7.5 and is created as follows. 


> profil:=plot([JacobiCN(u,1/sqrt(2)),1/sqrt(2)* 
(EllipticE(JacobiSN(u,1/sqrt(2)),1/sqrt (2) )/1/2-(1-1/2) /1/2* 
EllipticF(JacobiSN(u, t/sqrt(2)),1/sqrt(2))), 

u=-1.854074677. .1.854074677] ): 
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> poles:=textplot([[-.4,0.15, ‘North Pole‘], 
[-.4,-0.15, ‘South Pole‘]],color=black): 


> display({profil,poles}) ; 
So here are the commands that plot the balloon as shown in Figure 7.6. 


> plot3d(subs({R=1,kk=1/sqrt (2) ,u=u(t) ,v=v(t) } ,MylarR) , 
u=-1.854074677..1.854074677 , v=0. .2*Pi,scaling=constrained, 
shading=xy , lightmodel=light3, orientation=[45,69]) ; 


> plot3d(subs ({R=1,kk=1/sqrt(2) ,u=u(t) ,v=v(t)},MylarR), 
u=~1.854074677. .1.854074677 ,v=0. .13/8*Pi,scaling=constrained, 
shading=xy, lightmodel=light3, orientation=[-66,69]); 


We can also use Maple to calculate the metric coefficients, the second fundamental form co- 
efficients, Gauss and mean curvatures of the Mylar balloon. These of course verify the results 
in (Section 7.9) and the discussion immediately following. In the following, if Maple produces 
an un-simplified mess, apply the command “mysimplify” that was defined above. This should 
simplify the expression further. 


> EFG(subs({kk=1/sqrt(2)},MylarR)); 


2 
Roos 2 
a 0, —R* | —1 + JacobiSN (. 7) 


For instance, if the preceding command ends up un-simplified, type 


> mysimplify(EFG(subs({kk=1/sqrt (2) },MylarR))); 
> lmn(subs({kk=1/sqrt(2)},MylarR)); 


2 2 
R JacobiCn (1, 2) JacobiDN(u, 2) te. 


+ 0, 
5) 2 
bee JacobiSN(u, 2) 


2 
~) JacobiDN(u 2) (-14+ JacobisN( 2) ) J2 


2 
a\2 
p - sacobiSN(u, 2) 


> GK(subs({kk=1/sqrt(2)},MylarR)); 


R JacobiCN (« 


2 
2 
2 | —1 + JacobiSN (‘. +) 


R2 
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Of course this simplifies to 2cn?(u, 1//2)/R?. The mean curvature, which simplifies to 
3en(u, 1//2)/(2R), is found by 


> MK(subs({kk=1/sqrt (2)},MylarR)) ; 


2 2 
JacobiCN (. 4 JacobiDN ( ) af 2. 
2 2 
R |2— JacobiSN (. + 


A Glimpse at Higher Dimensions 


8.1 Introduction 


Up to this point, we have only considered surfaces x(u, v) dependent on two parameters. In 
mathematics and the sciences, however, it is often the case that geometric structures depend 
on many parameters. Indeed, the number of degrees of freedom of a physical system tells us 
precisely the number of parameters necessary to describe the so-called configuration space of the 
system. Just as we could use differential geometry to understand particles moving on constraint 
surfaces, so we would like to do the same for systems with many parameters. This means that we 
must invent a notion of higher-dimensional surface which mimics the properties of the geometry 
we are comfortable with, namely the geometry of two-dimensional surfaces. In this chapter, we 
will consider these higher-dimensional surfaces from a naive point of view with the goal of 
introducing the relevant notation and making the analogy with the two-dimensional situation. In 
this sense, this chapter is simply a dictionary for readers making the transition from the geometry 
of two dimensions to that of » dimensions. In particular, we will not introduce manifolds and 
their covariant derivatives in their most abstract generality, but instead stick (mostly) to the case 
of submanifolds of Euclidean space with the induced metric and covariant derivative. In this way, 
we hope the reader can be introduced to the geometry of higher dimensions while still maintaining 
some touch with reality. Basic references for this chapter are [dC92, Spi79, GHL90, Hic65] and 
(for connections with physics) [CM85]. 


8.2 Manifolds 


Just as for two dimensions, we may define a k-patch, or coordinate chart, x: D > R"* fora 
connected open set D C R*. Here we assume that 


i OG eet ark Es igt’)) 
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is smooth (1.e., all partials of all orders exist and are continuous), injective (with continuous 
inverse on its image) and the tangent vectors of parameter curves 


def ax! axtt! 
0;X = ry oS iaiy - 
du! du! 


are linearly independent (i = 1,...,k). We shall usually denote these tangent vectors, when the 
patch is understood, by 9; alone. A subset M* C R"*! is ak-manifold if it is covered by k-patches 
with the property that, for any two patches x: D, > R"t! and y: Dy > R"*! with non-empty 
intersection, the composition transition map 


x"! oy: y! (x(Dy) Ny(Dy)) > Dx 


is smooth. Of course, when we say “covered by k-patches”, we simply mean that every point 
of M lies in the image of some patch. Also, the k-dimensional tangent space to M at p € M is 
simply the subspace of R"*! spanned by the linearly independent vectors ;(p), ..., 9¢(p). The 
tangent space is denoted by 7, M just as for surfaces. 


Example 8.2.1 (The Sphere S” as a Manifold). 
The -sphere in R"*! is defined as 


gn def {(x!, ine xh? ee at (x"+1)? =1}. 


This is the direct analogue of S?, the 2-sphere in R?. Note that we call the sphere in R"+! the 
n-sphere instead of the n + 1-sphere. The reason for this terminology is that the sphere in R"+! 
is an n-manifold, not an n + 1-manifold. So, the appellation “‘n-sphere” denotes the dimension 
of the manifold itself, not the dimension of the ambient, or surrounding, space. Let us now show 
that S” is a manifold of dimension n. We must cover S” by patches which satisfy the “transition 
property” above. 

Now, stereographic projection St: S” > R” works in R"*! just as in R°. Namely, denoting 
the North and South poles by V = (0,...,0, 1) and S = (0,...,0, —1) respectively, we have 
the North and South projections 


1 n 
1 ntly _ — a 2 = 
Sty(x ye. Xx = (a oe) 
I n 
1 n+ly _ eT a ae so 
Sts(x',...,% = (ae on) 


Let x(u',...,u") = St7!(u!,...,u"): R° > S$" ~ {N} and y(w!,..., w”) = St (wi, ..., 
w"): IR” — S$” — {S} denote the respective inverses. Both x and y are smooth maps which are 
injective with smooth inverses (i.e., St) on their images. Also, it is easy to see that the parameter 
tangent vectors are linearly independent, so x and y are patches. Clearly, x and y cover S”, so we 
must only show the transition property to see that S” is an n-manifold. For this, note that x and y 
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have the explicit representations 


2u! 2u" Suge 
x(u',...,u")= ) yreey i) ’ ) 
Yui + | Yui +1 Yui +1 


2 1 Iw" i i2 
yeaa = ( = SES). 


14 owe?” 1+ Yo wi?’ 1+ dow? 
The transition map is then given by 
n Sty n Sts n 
R" — {(0,...,0)} —> 8S? —{N,S} —R 


and calculated to be 


1 n 
u u 

y'ox(u',...,u")= See a PH (wl... we"). 
du du 


Therefore, each component has the formula 


ui 
we —, 
Due 


and, since the origin of R” is precluded from the domain of the transition map, each component 
function is smooth. Thus, 5” is an n-manifold. 


i 


i = w 


u 


Exercise 8.2.2. Verify the formulas above for stereographic projection, x, y and the transition 
y~' ox and explain why the origin of R” is not in the domain of the transition. 


Now suppose that x: D, > R"*+! and y: Dy > R"*! are two patches with non-empty inter- 
section x(D,) 1 y(Dy) # @. Then, on the intersection, 


x(u!, ey) = y(w!,..., w*) 
and y~! ox(u',..., u*) = (w!,..., w*). Since each component function y* may be considered 
as a function of the u'’s, we have by the chain rule, 
dy ax* 
dul Bui 
7 * awi ay 
~ Let Bui Awi 
j=l 


This type of transformation rule is familiar to physicists who are interested in relating the physical 
quantities of one reference frame to those of another. Indeed, for this reason and because of the 
coordinate change formula above, physicists in the old days often defined the mathematical 
objects they were interested in (i.e., “tensors”) in terms of such transformation rules. If we write 
out the coordinate form of the formula above, we obtain 


ax! axtt! _ aw! ay! ay"t! A m aw* ay! ay"t! 
dui?” aut J Out Vw!” Ow! aul \awk’"? awk 7° 
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Written more compactly, we have 


and this formula shows that the coordinate change rule allows us to change the basis of the tangent 
space 7,M via the matrix 


aw! aw! aw! 
Out aut Ouk 
dw 
dul 
du! dui auk 


which we denote by J(u, w) since it is just the Jacobian matrix of multivariable calculus associ- 
ated to a coordinate change. This formulation allows us to make the notion of orientability precise 
(see Chapter 2). Say that a manifold M is orientable if there is a collection of patches {xa}aca 
which cover M such that the Jacobian matrices for all possible transitions x7' o xg have positive 
determinant (evaluated at all points in the overlap). Intuitively, we can think of this requirement 
as the analogue of saying that a rotation keeps an object oriented the same way while a reflection 


produces an oppositely oriented mirror image. One thing to notice is the following 


Proposition 8.2.3. Suppose that the intersection of two patches is path connected. In particular, 
suppose x(Dx) 1 y(Dy) # ©. Then the determinant of the associated Jacobian does not change 
sign when evaluated at any point in the intersection. 


Proof. We will give a rather sophisticated (as opposed to computational) proof here because the 
same ideas occur often in the geometry of higher dimensions. Choose two arbitrary points p and 
q in the intersection x(Dx) MN y(Dy) and take a path a: J > x(Dx) N y(Dy) C M with a(0) = p 
and a(1) = q. Now, just as in Lemma 2.1.3, we may write 


x(ul(t),..., u(t) = a(t) = y(w'(t), ..., w*(2)) 
with y~! ox(ul(t),..., u*(t)) = (w'(r), ..., w*(r)). In this way the Jacobian 


J(u, w) = Gr) 
u a(t) 


is dependent on ¢ as well and the determinant becomes a continuous function det J(u, w): 1] > R 
(since the partials are continuous). Now, J(u, w) € GL(k, R), the group of invertible k x k- 
matrices over R, so det J(u, w)(t) # 0 for all t. Therefore, by continuity (and the Intermediate 
Value theorem), the continuous function det J(u, w) cannot take both positive and negative 
values. CO 


Corollary 8.2.4. [fa manifold M is covered by two patches whose intersection is path connected, 
then M is orientable. 


Proof. We must only show that one single J(u, w) has positive determinant. Suppose it does not. 
Then, by either changing the sign of one coordinate function or interchanging two coordinate 
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functions in one of the patches, we change the sign of the determinant of J(u, w). This follows 
from the usual properties of determinants; namely, if a row (or column) is multiplied by —1 or if 
two rows (or columns) are interchanged, then the sign of the determinant is changed. Note that 
the connectedness of the intersection and the proposition above ensure that we must only check 
the positivity of det J(u, w) at a single point. 0 


Example 8.2.5 (The Sphere S”as an Orientable Manifold). 
For the patches x and y of Example 8.2.1, the transition function was calculated to be 


-1 ! my u! oe n 
y ox(u,...,u")= Sa SP =(w,...,w'). 
The Jacobian matrix may be calculated by first observing that 


dwi (owe?) 8) — udu 


dul (>), us?? 
where 5 =O if ij and 5 = 1 if i = j. (That is, 5: is the Kronecker delta.) By Propo- 
sition 8.2.3, we need to test the Jacobian at a single point, so choose (1,0,..., 0). Then we 
have 
-l1 0 -. 0 
0 f. #k&h 70 
J M0095, 
0 oOo .- J 


This matrix has determinant —1, so the given patches do not orient S". However, by Corol- 
lary 8.2.4, since the patch intersection is the path connected set 5" — {A’, S}, S” must be ori- 
entable. Indeed, by the proof of Corollary 8.2.4, we can redefine y to be a new patch 


2w 2w! 2u? 2w" ew) 
1+ we ewe 1 we a wee Tw 


obtained by switching the first two coordinate functions. The Jacobian matrix at (1, 0,..., 0) 
now has its first two rows switched, 


su!) = ( 


0 1 WK 0 
-1 0 -. 0 
JU, V)G,0.....0) = Sen tabs De 
0 0 ee ] 


and det J(u, v)c1,0,...,.0) = 1. The patches x and y therefore orient 5”. 


seney 


8.3 The Covariant Derivative 


From now on we will confine ourselves to the image of a single patch x: D C R‘ > R"™! 
on a k-manifold M*. Hence, from now on, we are doing Jocal differential geometry. From 
Proposition 8.2.3, we see that x is automatically orientable since D, and hence its image x(D) 
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are connected. For the patch x (which we may also refer to as M), the parameter tangent vectors 


9,,..., 0, form a basis at each point for the tangent space T,M. We may extend this basis for 
T,M to a basis for R"*! by choosing vectors, U;,..., Un+i—% in R"*! such that for all s and i, 
U,- 0; =0 
and 
U,-U, =6 


where 6! is the Kronecker delta. The U;’s are then normal vectors for M. If k =n, then only one 
normal vector U exists and, in this case, M is said to be a hypersurface. All surfaces in R? are 
thus hypersurfaces. Recall that a smooth vector field V on M is simply the assignment of an R"*! 
vector to each point of M so that V: M > R"+! is smooth. If the assigned vectors are always 
tangent to M, then V is said to be a tangent vector field. The vector fields 9;,..., 0, are tangent 
vector fields on M* while the vector fields U),..., Un41—% ate normal vector fields. 

For a vector field Z = (Z!,..., Z"*') in R"*!, the R"*!-covariant derivative is defined just 
as in the two-dimensional situation. Namely, 


ntl 
vez — Yo viz‘ ei 
i=] 
where e; is the standard i" basis vector for R’+! and v[-] denotes the usual R”*! directional 
derivative v[ f] = Vf - v. The covariant derivative for M is then defined to be the orthogonal 
projection of V®""' onto T,M, 
Vy Z = proj 7 yVR” Z 
where v € T,M, Z is defined on M and Z is a local extension of Z to an open set in R"*! 
containing M. Similarly, for a tangent vector field V, we may define Vy Z = proj 7, ae Z by 


*, atl = 
Vv Z(p) = proj TMV Vip) 2 


where V isa local extension of V as well. The following exercise shows that Vy Z is well defined. 


Exercise 8.3.1. Show that the definition of Vy Z does not depend on the extensions chosen for 
V and Z. Hints: (1) start by showing that Ve 2 = ve Z on M for V; and V2 extensions of 
V. Use the fact that V, = V2 on M. (2) Now show that VR" Z, = VE" Z, on M as well. 


From now on, for notational convenience, we shall dispense with distinguishing vector fields 
on M from their local extensions on open sets in R"+!. It is important to realize that the 
covariant derivative on M is simply the R"*! covariant derivative minus its U;-components, 
i=1,....m+1—-k. 


Proposition 8.3.2. Let V, Z and W be tangent vector fields on M and f : M — R bea function 
on M. The following are properties of the covariant derivative of M: 

(i) Vv(Z + W) = VWyZ 4+ VWyW; 

(ii) Vevd = f VvZ; 

(iii) WwfZ=Vif]Z+fVvZ; 
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(iv) V[(Z, W)] = (Vv Z, W) + (Z, Vy W), where (-,-) denotes the dot product in R"t! 
restricted to vectors tangent to M; 
(vy) VyZ—VzV =[V, Z], where [-, -] is defined below. 


Proof. To prove (i), it suffices to note that directional derivative and projection both preserve 
sums. For (ii), note that, by definition, 


phe = dU FVIZ 1e; = 1D; VIZ‘ Je; = fez. 


Then we take projections to obtain 
VevZ = proj nMViv- Z 
= proj T)M (rye Z) 
= f proj MV Zz 


since f(p) simply multiplies vectors by a scalar and, so, doesn’t affect projection. Then, by 
definition, 


Vila F vez 
For (iv), let us compute using the definitions of (., -) (i.e., dot product) and VF". 


V[(Z, W)] = V[_ Z'W'] by the definition of (., -) 


= > V[Z']wi + Ziv[w'] by the Leibniz rule 
R"*! Rt! ee Rt! 
=(Vy Z,W)+(Z,Vy W) by the definition of V 
= (VyZ, W) + (Z, VyW) 
because, since Z and W are tangent vector fields to M, all dot products with normal vectors 
(U;, W) and (Z, U;) are zero. Since VF" and V differ only by their normal components, the 


equality of the last two lines of the calculation above follows. Part (iii) is left to the reader in the 
exercise below and part (v) will be proved below after we discuss the bracket [-, -]. O 


Exercise 8.3.3. Prove the equality Vy fZ = V[f] Z + f VyZ in (iii) above. Hint: use (ii) as a 
guide. 


Now let us discuss the Lie bracket of vector fields. If V and W are tangent vector fields on M, 
then we define 


[V, WILf] = VIWLS]) — WIV TF). 


Here we are saying that [V, W] acts on a function f as a vector field should. We will 
show that this actually defines a tangent vector field on M by writing [V, W] as a linear 
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combination of the 0;. Also, to avoid a surfeit of square brackets, we will abuse notation a bit 


and write [V, W] = VW[f]— WV[f]. First, write V = )°; v'd; and W = >>, w/d; and then 
compute 


=W) vdiLf] 
= Me Jaf] + v W[9Lf) 


= 0 Slew! ajlv' aif] + v' w/a a;L FD) 
io 


j ou! ay ls . of 
=D (wei ae 
aul a du! dus 


and similarly, 


so that 


Ow! . av! af 
VW-WV = J mmm yy —— —_. 
( wn] ( (v oy =)) of 
In other words, we have 
’ Ow! aut 
= ig. be j — wi 
[V,W]= 2 a; where a= 2X (v aul w =| 
and the bracket [V, W] is then a tangent vector field on M. 


Exercise* 8.3.4. Apply the definition of bracket to prove the following properties: 
(1) [V, W] = -[W, V], 

(2) [aV + bW, Z] =a[V, Z] + b[W, Z] where a,b € R; 

(3) [[V, W], Z] + [[W, Z], V] +[[Z, V], W] = 0 (Jacobi Identity); 

(4) LFV, gW] = f)elV, W)+ fV[e]W — gWLf]V for smooth functions f and g. 

Now let us concentrate on proving (v) of Proposition 8.3.2. First, we will relate the covariant 


derivative and the bracket by a general formula which will eventually reduce to (v). To begin, 
note that [9;, 9;] = 0 for all i, 7. This follows because @;[f] = af/du', so that 


2 
(a;, 8/JLf] = 918,Lf1—aaffi= —2--—"F 9 


duldus du/ du! 
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since mixed partials are equal. Now, for V = >; vd; and W = > j w/a ;, we have 
WW => 0'Va, Yo w/a; 
i i 
= os Yo (v/d jw'd; + v! w'Va,9;) 
ij 
after switching i and j. Similarly, 


VwV = > > (wi djv'd; + wiv! Va, 3)). 
ij 


Then we have 


aw avi iG ro 
aN 2 ea) ye ee — wiv') V9,9; 


=[V,W]+ a > (v/w! — w/v) V9, 9;. 
joy 
To prove (v), we will show that the second term in this formula vanishes. 
Lemma 8.3.5. Va,9; — Va,9; = 9. 


Proof, From the general formula above applied to the R"*! covariant derivative V®""' with 
8; = >o, aXe, and 8; = 5”, b’e;, we obtain 


Ve aj — Ve 8; = [8;, 8j] + ) (a'd — bla*)VF'"e, = 040 = 0 
kl 


since [9;, 9;] = 0 by the discussion above and ve ei = 0 by definition of V®""’. Then, since 


var a, = ve 9;, the projections Va, 9; and V3, 9; must be equal as well. O 


Proof of Proposition 8.3.2 (v). Consider the general formula 


VyW —VwV =[V,W]+ 9° >> (vw! - wo!) Va,8; 
ij 


and focus on the last term 7; 5° j (viw! — w/v') Va, 9;. In this term, the factor in parentheses 
is antisymmetric in i and j while, by Lemma 8.3.5, the factor Va,d; is symmetric ini and j. 
Hence, summing over all i and j produces zero since a term and its negative always appear. 
Therefore, 


Vy W —VwV =[V, W]. 
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Suppose M” C R"*! is a hypersurface, so that there is only one (outward) unit normal vector 
field U on M. For tangent vector fields V and Z, we may write 
VR Z = Vy Z + (VEZ, U)U 
since Vy Z is the tangential projection of ve Z. Again note that V and Z must be extended to 
an open set in R"+! to make sense of VE" Z. 


Exercise 8.3.6. Show that the coefficient of the normal component above is (VE""' Z, U). 


We may write the equation above as 
VyZ = VR Zz —(VR"Z,U)U 
= VR'"'Z + (Z, VR" U)U 
since, by (iv), V(Z,U) = (VR"" Z, U) + (Z, VR""'U) and (Z, U) = 0. Then by symmetry of 
oe) 
VyZ = VE" Z — (S(V), Z)U 
where S(V) = —VE""'U is the shape operator of M in R"*!. The properties of Proposition 8.3.2 


(especially (iv) and (v)) may be used to prove the following generalization of Theorem 2.3.5. 


Theorem 8.3.7. The shape operator is a symmetric linear transformation on the tangent 
space. 
Proof. S is clearly linear by the properties of V&""" Because (U, U) = 1 is constant, 

0 = V{U, U) = (VE"'U,U) + (U, VEU) = 2(VR"'U,” U) 


and hence, (VE""'U, U) = 0. Thus, (S(V), U) = (— VE" U, U) =0, so S(V)(p) € TyM for 
all p € M. Now we must show that the linear transformation S is symmetric. Throughout the fol- 
lowing, we use the fact that (U, Z) = 0 for any tangent vector field Z as well as Proposition 8.3.2 
(iv) and (Vv). 


(S(V), W) = (- VEU, W) 
= (U, VR" Ww) by (iv) and (U, W) = 0 
=(U.[V.W]+VH VV) by(v) 
= (U,[V, W]) +(U, VR V) 
=0- (ve U,V) since [V, W] is tangent and by (iv) 
= (V,-VE"U) 
= (V, S(W)). 
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Example 8.3.8 (The Shape Operator on the Sphere S" of Radius R). 


In coordinates x!,..., x"*+!, the unit normal vector field for the n-sphere of radius R in R"t! is 
U — I (x! er") 
R ‘3 + 


The Euclidean covariant derivative is just the coordinate-wise directional derivative, so for a 
tangent vector field V = (V',..., V"*'), 


n+] 


S(V)\(p) = — VEU 
l 
=~ (V(pyx' hs V(pEe"*')) 


1 
= R (V'(p), sab oy v"*}(p)) 


eA) 
R 
where the third line follows since the x! are coordinate functions and we have 
: oxi ae : 
ij S{yiaeyisi ea 
Viel oa = vis) = Vi. 
j 


Hence, the shape operator of the n-sphere is completely analogous to that of the 2-sphere — it is 
simply scalar multiplication by the negative reciprocal of the radius of the sphere. 


While the shape operator for hypersurfaces plays a role similar to that for surfaces in R?, it must 
be realized that it can never assume as important a position in general for higher-dimensional 
manifolds. This is simply because hypersurfaces are rather rare among manifolds in higher 
dimensions whereas surfaces in R? are, by definition, hypersurfaces. For manifolds in R"+! 
which are not hypersurfaces, we may replace the shape operator, or more precisely (S(V), Z) U, 
by the more general second fundamental form. When we write, for tangent vector fields V and Z, 


Vv Z = proj, yVF"'Z, 


we are ignoring the normal component of VF""’. Let us now put it back in by writing superscripts 
T and N to denote the tangential and normal projections respectively; 
veo Z = (oe zy a ch z)% 
2 vyZ + BV, Z). 


B(V, Z) =(V8"" Z)¥ is called the second fundamental form of MF c R"*!, 


Proposition 8.3.9. B(., -) satisfies the following properties: 
(1) BUFV, Z) = f BUY, 2), 
(2) BV, fZ) = fB(V, Z), 
(3) B(V, Z) = B(Z, V). 
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The first two properties are included in the statement that B is bilinear; that is, linear in each 
variable separately. The last property is what is meant by saying that B is symmetric. 


Proof. We shall prove (2) and leave (1) and (3) to the following exercise. By definition, 
B(V, fZ) = VE" fZ—VyfZ 
=VIFIZ+ $V" Z — VIFIZ — fVvZ 
by property (iii) of Proposition 3.1. Hence, 
= fVP"Z — fVyZ 
=e, (vPn'z = VyZ) 
= fB(V, Z). 


0 


Exercise 8.3.10. Show that properties (1) and (3) hold above. Hints: for (1), use property (ii) 
of Proposition 8.3.2. For (3), use property (v) of Proposition 8.3.2 and the fact that [V, Z] is a 
tangent vector field, so [V, Z]” = 0. 


Now, by Proposition 8.3.9, at each point p € M, B(p) is a symmetric bilinear mapping 
T,M x T,M — N,M, where N,M = (U,,...,Un+i-4) 18 the normal space to M in Reet. 
From B we can define the mean curvature vector field to be 


k 
H(p) = > (VR a, 


s=] 


k 
= D> B(p)(as, 9s) 
s=] 
where 0), ..., 9, are assumed to be orthonormal at p and B(p)(0,;, 8;) denotes the evaluation 
of the bilinear form B(p) on the pair (@,, 9;). 


Exercises 8.3.11. Show that this definition of H, in the case of a two-dimensional patch x(u, v) 
in R?, is just twice our old notion of mean curvature. 


We will talk more about the second fundamental form after we discuss Christoffel symbols 
in the next section. A good general reference is [dC92]. It is important to note that what we 
have done in this section is not the most general definition of a covariant derivative. We have 
already said that manifolds may be defined without reference to an ambient Euclidean space, 
but it is also the case that covariant derivatives V (or connections as they are also called) may 
be defined without reference to an ambient Euclidean covariant derivative V®""’. The essential 
properties of an abstract covariant derivative are properties (i), (ii) and (iii) of Proposition 8.3.2, 
while properties (iv) and (v) of that result endow an abstract covariant derivative with the special 
names Levi-Civita connection or Riemannian connection. Every manifold (with given metric) 
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has a unique Riemannian connection and, for our manifolds M k C R"+! with the induced metric, 
the connection we have defined is plainly that Riemannian connection. 


8.4 Christoffel Symbols 


In Section 3.4, we introduced Christoffel symbols as the coefficients of a basis expansion for the 
second partials x,,, and x,, in terms of x,, x, and U. Recall that we assumed that the basis had 
F =x, -x, = 0 so that we obtained 

Kyu = Po yXu + Py Xv +lU 

Xuy = CiyXu + PYyXy + mU 

Xyy = U5Xy + UX + aU. 


We were able to identify (see Formula(s) 3.4.3) the Christoffel symbols in terms of the metric FE 
and G (with F = 0 remember); 


E 
Xuy = yom G6 +1U 
E G 
Xyy = xE™ + xG* +mU 
or Gs gatas +nU 
we op Get 


Now, X,, 1s nothing more than ves, and similarly for x,, and x,,. Also, removing the 
U-components /U, mU and nU from the expressions above then produces V,,x,, Vx,X, and 
Vx,Xy respectively. To generalize this to k dimensions, we simply write 


k 
ve a, — > T,ds + S > bi U, 
s=l t 


and, taking tangential components, 


k 
Va,9; = > Mijas. 
s=l 
Exercise 8.4.1. Use Lemma 8.3.5 to prove that 7, = Tj, for alli, j, s. 


Just as Christoffel symbols in the two-dimensional case may be expressed in terms of the 
metric, so too can higher-dimensional Christoffel symbols be expressed in terms of the metric 
(in traditional higher-dimensional notation) 


ij = (9;,8;) with  g =det(g;;) 


where (g;;) is the matrix of metric coefficients. Of course, we have g;; = gj; so the matrix is 
symmetric as well as being invertible. We also have need of the inverse of the matrix (g;;) which 
is denoted by (g'/). This inverse will allow us to isolate certain quantities as exemplified by the 


410 8. A Glimpse at Higher Dimensions 
proof below. In particular, note that we always have the relation 
s- & es 8js = OF 
i 
where 6” is the Kronecker delta. 
Proposition 8.4.2. The Christoffel symbols are determined by the metric as 


1 
Y= ee (8:85; + 9s8i; — 9;8si) - 


Proof. Property (iv) of Proposition 8.3.2 gives 


958i; = 95{9;, 9;) 
= (Va,9;,9;) + (8;, Va, dj) 


= (0181, 8;) + (85, > P41) 
1 
= 0 (Thay + Tau). 
I 
Similarly, 


9:8js = y (Tipats + Ti,8,0) and 98s; = 2 (M4 .8 + P48) 
i ; 
with 


9:81; + 9:8 js — Oj8si = d Ti,8jl- 
Now, multiplying by the mth row of the inverse metric matrix, we obtain 
; yo 8” (818s; + 958i; — 9/85) =u Ce een ete 
j j 
since 5); g/g j1 = 4)". Oo 
Corollary 8.4.3. For M = R"*!, the Christoffel symbols are all zero. 


Proof. The metric coefficients g;; are all constants, so their partials are zero. By Proposition 8.4.2, 
the Christoffel symbols vanish. O 


Example 8.4.4 (Christoffel Symbols in Dimension 2). 
Consider a two-dimensional patch x(u, v) = x(u!, u*) with F =x, - x, = 0. The metric matrix 


and its inverse are then 
sont FEY D) iy _ (I/E 0 
(gi) = C a (g | 0 i) 
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with determinant g = det(g;;) = EG. Then, for example, 


1 E 

rm =Th=5 = (Eu + By Ey+50= 55 
l E 

My =Th=7GOth-O+5-0= 5%. 
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Exercise 8.4.5. Show that, =}, = —G,/2E,T?, =1?, = —E,/2G,T?, =T?, = G,/2G 


andl’, = 13, =G,/2G. 


Since the formulas for the "’s above occur in the geodesic equations in two dimensions, it 
should be no surprise that they occur as well in higher dimensions. More generally, a tangent 
vector field V on M is said to be parallel along acurvea: I > Mé = x(u!,...,u*)if Ve V =0 
at every point on the curve. This condition may be translated into local coordinates as follows. 


Let V = )o v/a). 


dui avi 
=) 7 ras po Dey 
dui a 
“refer yprepns 
t Ss 


du' dvs du! 
= ideale cna Sa wea OF 
Ps dr Gui dp 28 


This expression is zero exactly when each component is zero. Hence, for each s we have 


dui av’ — du! ; 
eee Sa Tee SO, 
De oe ae 


Now, the first term }>; (du' /dt - dv'du') is precisely dv‘ /dt by the chain rule. Therefore, 


Proposition 8.4.6. V is parallel if and only if, for each s, 


+22 rte y =; 


Corollary 8.4.7. A curve a is a geodesic on M* if and only if 


au du’ dul 
— r?,—_——_ = 0. 
dt? 122 4 dt dt 
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Proof. We have a(t) = x(u'(t),..., u*(t)), so that a’ = }° (du/ /dt)d;. Thus, the geodesic con- 
dition V,a’ = 0 translates into the equation above by taking the formula of Proposition 8.4.6 
and substituting du/ /dt = v/. Oo 


Parallel vector fields always exist and are unique because the condition of Proposition 8.4.6 is 
a linear first-order differential equation. Here however, we cannot determine one specific angle 
function which describes the parallel vector rotation. The reason for this is apparent. It is only in 
the plane that rotations are determined by angles. In higher dimensions, it is the special orthogonal 
group SO(n), consisting of matrices A with det(A) = 1 and AA‘ = I, which describes rotations. 


Exercise 8.4.8. Define parallel translation as follows. Given Vo at p € M and acurve a on M 
with a(0) = p. Let V be the unique parallel vector field along a with V(0) = Vp. Then V(f) is the 
parallel translate of Vp. Show that this association is a linear transformation from T,M to Ty,)M. 
Further, this linear transformation is an isometry. That is, for parallel vector fields V and W along 
a, the inner product (V, W) is constant along a. It may also be shown that parallel translation 
preserves orientation and it is known that orientation preserving isometries are in SO(n). Hint: 
to show an isometry, use property (iv) of Proposition 8.3.2. 


In Chapter 7 we showed that geodesics arise as extrema for the energy integral { T dt where 
the kinetic energy is given by T = 1/2 |a’|?. Inhigher dimensions a’ = )~,(du' /dt) d; and, since 
we do not assume 9; - 8; = 0, we then have |a’|? = 5°, ; gi;14'u/. The appropriate energy integral 
we wish to extremize is then 


[ue u'(t), alk gf u*(t), w(t), .. . wnat = def / Yo gia! dt. 
ij 


Here we have dropped the superfluous 1/2. Generalizing Exercise 7.1.7 to k functions, for each 
s we obtain an Euler-Lagrange equation 


d {aL OL 0 
dt \aus) aus 
We can compute each piece directly as follows. 


aus =o Ges it! and aa = 20" + Dg 


with 


i (58) -¥ (oe Dla) + (ow ES 
= = 2D gio + 2 (+ ze) wa’ 


Hence, the Euler-Lagrange ae becomes 


d OL O85; O8is Ogi; Sin j 
& = Dae =f) 
dt (Gs) - ie aus = Dail oo Ge Pre iy ie 
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Now here is where we can use the identity }°, g”*g,; = 6 to isolate the second derivative 
™= >. >>, gg,iii' above. Sum the equation above by }~, g”* to obtain 


ms O85; d8is O8ij veo 
Ls 2 sisi += phlaet aul ~ ous wus | =0 
985i O8is O8ij ehcp 
oe 2 sisi + D3 Ls eo ( aul ta = aus uu’ = 0, 


which then reduces to 


i” + > Trai! = 0. 
y 
These are, of course, the geodesic equations. Therefore, in higher dimensions as well, the geodesic 
equations arise from a variational principle. 


Exercise 8.4.9. Show that, for an orthogonal two-dimensional patch x(u, v), the geodesic equa- 
tions above are those of Chapter 5. 


We now wish to show that certain normal coordinates may be chosen for which the covariant 
derivative is quite well behaved. This choice of coordinates will simplify our discussion of 
curvature in the next section greatly. 


Theorem 8.4.10. For a manifold M‘ and p € M, a patch x(u',..., u*) may be chosen about p 
with x(0,..., 0) = p so that 
8ij(P) = (8;(p), 9;(p)) = 5; and =—-Va,8(p) = 


These coordinates u',..., u* are called normal coordinates. 


Sketch of Proof. The details of a proof of this result require knowledge of the exponential map. 
Since we have not talked about this subject, we can only outline the ideas involved here. Take 
the tangent space at p and, for any vector v € T,M, map the line through the origin of T,M in 
the direction v to the unit speed geodesic a, through p in the v direction, 


tv a(t). 


For simplicity, assume M is complete, so that geodesics run forever. This definition then defines 


a k-patch from T,M to M. Choose an orthogonal coordinate system u',...,u* for T,M as 
coordinates for the patch x. By definition, g;; = 4; at p since the coordinates are orthogonal. 
Also, for a tangent vector v = (v!,..., v*), we have, 
i key — ea k 
x(u'(t),..., u(t) =a (t) = x(tu',..., tu’) 


so that it! = d?(tv')/dt? = 0 for all i. The geodesic equations, which of course hold on a, become 


2X Pr (ay(t))eid = 2 r7(a(t)u'vd = 0, 
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and this is true for all v € T,M. But this can only happen if all the I’; are zero. Hence, 


V2,9;(p) = )_T7(p)am = 0. : 


With the preliminaries above, we can end this section by giving a brief (and somewhat 
simplistic) sketch of the relationship between the higher-dimensional mean curvature vector field 
and higher-dimensional “area” minimization. Note how the use of normal coordinates makes 
the argument much simpler than it otherwise might be. The reader is recommended to [Law80, 
Theorem 1.4] and [GHL90, Theorem 5.19] for rigorous details. 

Suppose we have a manifold M (which we assume is rather local in the sense of being a 
single patch, say) and we vary it as in Theorem 4.3.4. Namely, we take M, = M + tU, where we 
assume U is a vector field which is always normal to M. For a set of coordinates (u',..., u*), 
the area element is given by 


dA=/gdu'... dué 


where g = det(g;;) is the determinant of the metric coefficient matrix. (In two dimen- 
sions recall that dA = VEG — F? du dv.) Similarly, for g, = det(g,; j) on M,, we have area 
element 


dA 
thee edi ga EO 


ve V8 
and the higher-dimensional area is then 
A(t) = v8 dA. 
M, V8 


As in Chapter 4, we want to find critical points for this area functional. Note that the integral 
above depends on the point about which we take our coordinate system, so let us fix p € M and 
let us take a normal coordinate system about p. Therefore, for these normal coordinates, at p, the 
tangent vector fields 8;,..., 8, are orthonormal and all the covariant derivatives Va,9; vanish. 
We can consider the normal vector field U as another local coordinate, so we have, for all i 
and j, 


[9;,9;] =0 and [U, i] = 0. 


ala) 


Because the coordinate vector fields are orthonormal at p, we have g = | there. Hence, 


Now we wish to calculate 


dA(t) 


dA. 
dt 


t=0 


4 ()| dee 
at\ Je) \n0 2 V8 lao 
_ 14g 
eee aes 
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Exercise 8.4.11. Let Q(t) = (q;;(t)) be a matrix whose entries depend on ¢ and for which 
Q(0) = I, the identity matrix. Show that 

dqi;(0) 

dt )° 


Hints: write out the definition of the determinant of Q(t), 


det Q = SoH) gio) "++ Pko(k) 


oeS, 


d 
Fi det Q(t)|,_) = tr ( 


where o is a permutation in the permutation group on k letters 5;,, and use the product rule to find 
dQ/dt. Then evaluate at t = 0 to make all terms in the determinant zero except for those having 
each factor as q,,(0) = | together with a single dq;;(0)/dt. Thus, the determinant sum reduces 
to the sum of dq;;(0)/dt’s and this is tr Q’(0). 


Since g = det(g;;), the exercise may be applied to the result above to produce 


4 () — Lyn Bau 
BN BI eh 2 Ok ye 
k 
1 Oi2e- 
~ 5 5y (Oi 91) |, 0 


where 4; is the vector field at level t determined by the local diffeomorphism m +> m +1tU. 


Then, 
k 
d (8 1 ie 
ae) = 9 Ui 8-9 
87 Ir=0 i=} 
since U is the t-direction. By property (iv) of Proposition 8.3.2, we then have 
k 
d /8r aa Rts Rt! s 
— (Vi 9, a;) + (8;, Ve 4; 
ae 2 3 TaN PO a 


by (v) of Proposition 8.3.2 and the vanishing of the brackets mentioned above. Hence, again by 


(iv), 
a () 


k 
= 0 4,(U, 5) |,_) — (UU, VF"'5)) |g 


t=0 i 


=! 


= S~a;(U, a;) — (U, Va,8;) — WU, (VE"a)  ) 
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since VR" = V +(VB""')N and, at t = 0, we are on M. Also, for t = 0, 
U and the 0; are orthogonal and the coordinates are normal, so 


5 (B)|_-D-w (ray) 


1=0 0 j=] 


k 


=-(U,>- (ve"'a,)") 


i=] 


= —(U, H) 


where H is the mean curvature vector field of M in R"+!. After integration, we have the 


Theorem 8.4.12. The derivative of the area functional at zero is given by 
dA(t) 
= | —(U,H)dA 
t=0 M 


dt 
where H is the mean curvature vector field and U is the normal variational direction. 


If M (i.e., t = 0) is a critical point for all variations, then we must have H = 0. This follows 
because any nonzero part of H would produce a nonzero (U, H) for some U and this then could 
be localized by a bump function to produce a nonzero integral, contradicting the assumption that 
M is a critical point of A(t). Thus, we have the higher-dimensional version of Theorem 4.3.4, 


Corollary 8.4.13. Jf M is least area, then the mean curvature vector field is zero. 


8.5 Curvatures 


In Section 3.4 we developed a formula for Gauss curvature which depended only on the metric. 
This led of course to Gauss’s Theorem Egregium. If we look closely at how the formula was 
derived, we see that we used the fact that x,» — X,», == 0. Written in our “covariant” notation 
(and noting that the covariant derivative with respect to a parameter curve is simply partial 
differentiation), we have 


ver yee X, a ye yee X, = 0. 
Furthermore, the derivation also reveals that we made essential use of the basis descriptions of 
U,, and U,, where U is the unit normal of the surface. In fact, it was these terms which provided 


the (In — m?)/EG = K inthe formula. Now, however, we must use M*’s covariant derivative V 
without the benefit of a unit normal. Therefore, we should not expect the corresponding quantity 


Va; Va, Os = Va, Va, 0s 
to be zero. Indeed, analogous to the derivation of the two-dimensional formula (but without the 
ingredients U, and U,,), we may tentatively define a vector field R(0;,9;)8; = }°, Ri js91 by 
R(Q;, 9;)d, = Va, Va,Os — Va, Va; 0,. 
The object R(-, -) will be called the Riemann curvature, but the formula on the right-hand side 
does not yet describe it in general. We want R to possess certain qualities. Namely, we want R to 
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be bilinear with respect to smooth functions. By this we mean that R(fX,Y) = f R(X, Y), 
R(X + Z, Y) = R(X, Y)+ R(Z, Y), RX, fY) = fR(X,Y) and R(X, Y¥ + Z) = R(X, Y) + 
R(X, Z) for any smooth function f: M — R. But, as the reader is asked to check in the 
following exercise, as yet, this is not the case! Moreover, we want R(., -) to be a linear map on 
vector fields as well. 


Exercise 8.5.1. Use the properties of covariant derivatives to show that 


R(;,9;) fas = f R(d;, 9j)ds. (1) 
R(f9;,9;)8, = 0; fVa,9s + f RO, 9;)as 
= —Viza;,a;195 + fR(G;, 0;)ds. (2) 


where the last line follows from Proposition 8.3.2 (ii) and Exercise 8.3.4 (4). Hence, R is not 
linear in the first (or second) variable. 


The last formula of the exercise gives a hint as to how to redefine R so as to achieve bilinearity. 
Namely, for tangent vector fields X, Y and Z, we have 


Definition 8.5.2. The Riemann curvature is defined to be 


R(X, Y)Z = Vixy1Z + Vy VxZ — VxVyZ. 


Exercise 8.5.3. Show that 
R(9;, 9;)a; = s Rijs91 


ar", sd as 
= De oul ~ il + UTE in ay Py in a. 
I 


Since, as we showed previously, all Christoffel symbols vanish in R"+', the expression above 
says that the Riemann curvature of R"+! is zero everywhere. We say that R”+! is flat. Further, 


show that 
def 
Rijsr dX Rijs8tr 


= » (3 - 5 et LTE im - ri) Sir 


and calculate R121 for a two-dimensional patch x(u!, u*) = x(u, v) with g1; = E, g12 = g21 = 0 
and g22 = G. Show that 


i eg | EeGu _ 1 (Bs EvGy | EvEy 1 (Gu) _ GuG 
ai 4E°G E\2G), 4EG? 4E°G E\2G), 4EG? 


and compare this with the formula above Exercise 3.4.5 to see that 


Ry212 
&§ 


K= 
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where K is Gauss curvature and g = det(g;;) = EG. Thus the Riemann curvature generalizes 
the Gauss curvature. 


Exercise 8.5.4. Suppose M" Cc R"t! is a hypersurface with unit normal U and shape operator 
S. Show that, for tangent vector fields X, Y and Z, 


R(X, Y)Z = (S(X), Z) S(Y) — (S(Y), Z) S(X). 
Hints: (1) write 
R(X, ¥) = (VB — Sx.) + (VF — Sv) (VE - Sx) - (VE — Sx) (VF - Sr) 
where Sy(Z) = (S(Y), Z). (2) Use the facts that VR"), + VE" VR’ — VR"! VR" — 0 (since 


[X.¥] 
R"t! is flat) and (VB""'U, U) = 1/2 ¥(U,U) = 0 (since (U, U) = 1 is constant). 


It is clear that Riemann curvature is a complicated object to deal with. We can gain some 
intuition about it however by considering vector fields in pairs. First, let us introduce the 
notation 

R(X, ¥, Z, W) © (R(X, Y)Z, W). 

Then we define the sectional curvature of the two-dimensional subspace of T,M spanned by 
X(p)and Y(p) to be 

R(X, Y, X,Y) 
(X, X)(¥, ¥) — (X, ¥)?” 
As the notation suggests, the sectional curvature is a Gauss curvature of a two-dimensional 
submanifold of M through p with tangent plane (X, Y). This generalizes Exercise 8.5.3, where 
K = Ri212/g = R(d1, 92, 8), 82)/g. Although we will not prove it, it is a fact that the sectional 
curvature actually determines the Riemann curvature. Of course the reverse is true by definition. 
Also note that if X and Y are orthonormal, then K(X, Y) = R(X, Y, X,Y). 


K(X, Y)= 


Example 8.5.5 (Sectional Curvature of the Sphere S” of Radius R). In Example 8.3.8 we saw 
that the shape operator of S” (of radius R) is given by S(X) = —X/R, where R is the radius of 
the sphere. From the formula of Exercise 8.5.4, we obtain 


R(X, Y)Z = (S(X), Z) S(Y) — (S(Y), Z) S(X) 
1 
= Ball, ZY — (¥, Z)X). 


The sectional curvature has an even more perspicacious appearance — especially when we 
take X and Y to be orthonormal. For then we have 


K(X, Y) = R(X, Y, X,Y) 


= ~5((X, X)Y —(Y, X)X, Y) 
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since (Y, X) =0 = (X,Y) and (X,X) =1=(Y,Y). Thus K =1/R? is the constant sec- 
tional curvature of S$”; that is, completely analogous to the constant Gauss curvature 
of S?. 
Exercise 8.5.6. Show that the Riemann curvature obeys the following symmetry relations: 

(1) R(X, Y, Z, Ww) = —R(Y, X, Z, W); 

(2) R(X, Y,Z, W) = — R(X, Y, W, Z); 

(3) R(X, Y, Z, W) = R(Z, W, X,Y). 


Hints: (1) (VyVxV, V) = Y(VxyV,V) —(VxV, VyV) and (2) (Vix.7]V, V) = 1/2[X, Y] 
{V,V), both by (iv) of Proposition 8.3.2. For part (3), use the first Bianchi identity 
below. 


Exercise 8.5.7. Prove the first Bianchi identity: 
R(X, Y, Z, W) + R(Y, Z, X, W) + R(Z, X,Y, W) = 0. 


Hints: W is superfluous, so drop it from the notation. Write out the definitions of the terms 
and group them to make use of the identity Vy Y — Vy X = [X, Y] etc. Then group the terms 
according to the respective Lie brackets which arise and apply the identity again to end up with 
only brackets in the formula. Apply the Jacobi identity (Exercise 8.3.4 (3)). 


There is a second Bianchi identity which will prove very convenient a bit later. This identity 
involves covariant derivatives of the Riemann curvature, so we must understand how these are 
defined. The covariant derivative of R is given by 

VR(X,Y,Z,W,V) =Vv R(X, Y, Z, W) 
=V[R(X, Y, Z, W)] — R(Vy X, Y, Z, W) — R(X, Vv, Z, W) 
— R(X, Y, VvyZ, W) — R(X, Y, Z, Vy W). 


Proposition 8.5.8 (The second Bianchi identity). 


VR(X, Y, Z, W, V) + VR(X, Y, W, V, Z) + VR(X, Y, V, Z, W) = 0. 


Proof. Choose normal coordinates (see Theorem 8.4.10) about a point p € M and test the 
identity at this arbitrary point. Let X, Y, Z, W and V be in the orthonormal coordinate basis 
with Vx Y(p) = 0 etc. Also note that brackets of these vector fields vanish as well by [X, ¥] = 
VxY¥ —VyX =0-—0=0. Then 
VR(X, Y, Z, W, V) = VyR(X, Y, Z, W) 

= Vy R(Z, W, X,Y) 

= V[R(Z,W,X,Y)] since Vy(-) =0 

= (Vy R(Z, W)X, Y) 
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by Proposition 8.3.2 (iv) and Vy Y = 0. Similarly, VR(X, Y, W, V, Z) = (VzR(W, V)X, Y) and 
VR(X,Y, V, Z, W) = (VwR(V, Z)X, Y). Hence, X and Y are superfluous, so we remove them 
from the notation. Then 


Vy R(Z, W) + V2R(W, V) + VwR(V, Z) =(Vy Vw — Vw Vy)Vz 
+(VzVy — Vy Vz)Vw + (VwVz — VzVw)Vv 


and, evaluating on X, we get (Vy R(Z, W) + VzR(W, V) + VwR(V, Z))(X) = R(W, V, VzX) 
+ R(V, Z, VwX) + R(Z, W, Vy X) since brackets are zero. But then, since VzX = 0 etc., we 
have 


(Vy R(Z, W) + VzR(W, V) + VwR(V, Z))(X) = 0. 


Hence, the result is proved. 0 


While the Bianchi identities may seem rather esoteric, they do simplify various calculations 
considerably as we will see below. 

Higher-dimensional differential geometry offers us both the challenge of understanding non- 
visualizable geometric phenomena and the opportunity to create new tools with which to study 
such phenomena. Some of these new tools are new types of curvatures which, in two di- 
mensions, become the Gauss curvature. We have already seen that sectional curvature can 
tell us information somewhat obscured by the Riemann curvature. Sectional curvature, how- 
ever, cannot be the final answer since, as we mentioned previously, it determines Riemann 
curvature. Instead, there is a general method called contraction which is available to us in 
higher dimensions and which allows us to isolate more tractable portions of the Riemann cur- 
vature. In order to do this, we will generalize the notion of a frame field which was used 
in Chapter 6. If €),..., € are vector fields defined on a neighborhood of a point p ¢ M* 
with 


(E;, Ej) = 5) 


at every point in the neighborhood, then the collection (E;,..., &} is called a frame about p. 
One way to obtain a frame is to choose normal coordinates around p (Theorem 8.4.10) and 
then parallel translate the corresponding vector fields along geodesics passing through p. By 
Exercise 8.4.8, parallel translation is an isometry, so the orthonormality of the frame is preserved 
as it is translated to other points in the normal coordinate patch. The one bothersome thing about 
using a frame is that we lose many of the coordinate formulas we obtained earlier. Nevertheless, 
we shall see that the advantages of a frame outweigh the disadvantages, so we assume that we 
have a frame about p in what follows. 


Definition 8.5.9. The Ricci curvature is defined to be 


k 
Ric(X, ¥) = )> (R(X, &)Y, Ei) 


ist 
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where X and Y are tangent vector fields on M*. The scalar curvature is defined to be 
k 
k=) Ric(E;, €)) 
j=l 
k 
= Do (RE; ENE). Ei). 


ij=l 


The first thing we should note is that these definitions are independent of the frame field we 
choose. The reason is this. Any two frames are related at a point by an orthogonal matrix; that is, 
a matrix A with A’ = A~!. This is equivalent to saying that the rows of A form an orthonormal 
set of vectors. This, together with the symmetry relations of Exercise 8.5.6 suffice to prove the 
invariance of the definition. Rather than give a proof of this in general, we will simply concentrate 
on the k = 2 case. So, suppose that {#;} is another frame field which is related to (€;} bya 2 x 2 
orthogonal matrix A = (a,,) as follows: 


€; = ay; Fy + arj Fo. 


Now we can just compute using the fact that the rows of A are orthonormal. That is, a,;@2; + 
@\2427 = 0, a}, +a}, = | and a}, +. a3, = 1. 


Ric(X, Y) = (R(X, E:)Y, E1) + (R(X, &)Y, E2) 
= (R(X, ay Fy + ani F2)Y, ay Fy + a21F 2) 
+ (R(X, a12F) + a22F2)Y, ayF) + ar2F2) 
= at, (R(X, Fy)¥, Fy) + aya (R(X, FY, Fh) 
+ a21a1,(R(X, F2)¥, F,) + a3, (R(X, Fa)¥, Fh) 
+ aj, (R(X, Fy)¥, Fi) + a12a29(R(X, Fi)¥, Fa) 
+ 224 (R(X, F2)¥, Fy) + a3y(R(X, F2)¥, Fo) 
=a? (R(X, Fi)Y, Fi) + 2ayiaz (R(X, Fi)Y, Fo) + a3, (R(X, Fa)¥, Fr) 
+ a?,(R(X, Fy), Fi) + 2arnara (R(X, Fi) ¥, Fo) + a5,(R(X, Fa)¥, Fr) 
using the symmetry relations of Exercise 8.5.6. Then 
Ric(X, Y) = (a?, + a?,)(R(X, Fi), Fy) 
+ 2(ay1421 + a12422)(R(X, Fy)Y, Fa) + (ad, + a3) )( R(X, Fe)¥, Fa) 
= (R(X, Fi)¥, Fi) + (R(X, F2)¥, Fa) 


by the equalities a),a2; + a124@22 = 0, aj, +.a?, = 1 and a3, + a, = 1. Hence, the Ricci curva- 
ture is well defined. 

The second thing to note is that both Ricci and scalar curvatures come about through a “trace- 
like” process. Of course, to take the trace of a matrix A = (a;;), we form 


trA= > aii = 2s (A(E;), E;) 
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for an orthonormal basis {€), ..., E€,}. Here, for Ric, we take two slots of the Riemann curvature 
and sum over the same elements of the frame. Similarly, and even more closely analogous to 
trace, to form «, we sum over the only available slots in Ric. This process of summing over 
two slots with the same frame elements is called contraction and is sometimes denoted C. Thus, 
Co4R = Ric and Ci Ric = « since, in the first case, we sum over the second and fourth slots 
and, in the second case, we sum over the first and second slots. We note here that some authors 
order the subscripts on Rj;1, differently, so contraction subscripts may have to be adjusted to 
correspond to ours. As the reader might expect, there is a more formal and precise notion of 
“contraction”, but we shall have no need of it here. 


Example 8.5.10 (Contraction of the Metric). 


(i) For a k-manifold M*, let us compute the (only available) contraction of the metric (-, -). For 
a frame as above, since (€;, €;) = 5/, 


(ii) Contraction can also be done by coordinates. Without going into details, for coordinates 


u',...,u* and basis tangent vector fields 0,,..., 8, with (R(d;, 9;)0;,9-) = Rijs, let 


Ris = e Rijsrg”” 
Pa 


These are then the components of Ricci curvature. Thus, by Exercise 8.5.3, 


ar, - . 
Ris = ye »- (a are we +2 rein — rie] girg! 
i 
ar. ee, ote 7 
a (Fs - ous mG ae Vist im — isl im Y- girg! 
ari, aly 
= & out + dX a a is ery :) 
t 


since )°, girgi” = 84. 


In order to see the power of these curvatures, we offer the following theorems without proof. For 
a proof of the the first, see [GHL90] for example. 


Theorem 8.5.11 (Myers’s Theorem). Let M* be a complete k-manifold and suppose that the 
Ricci curvature on all of M* is strictly bounded away from zero; that is, 


Ric 


Then M* is compact and the diameter of M® is less than or equal to mr. 
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Compare this result with Bonnet’s result, Theorem 6.8.16. Scalar curvature also has the power 
to constrain the type of a manifold. Compare the following result (due to Ros [Ros87]) to 
Alexandrov’s result, Theorem 4.4.6. 


Theorem 8.5.12. A compact hypersurface M" C R"*! of constant scalar curvature is a sphere 
5S" with metric induced by that of R"*", 


Exercise 8.5.13. For a surface defined by an orthogonal patch in R?, show that Ricci and scalar 
curvatures are given by 

Ric(X, Y) = K (X,Y) and kK =2K 
where K is Gauss curvature. Hint: use Exercise 8.5.3. The orthogonal patch condition is, in fact, 
unnecessary, but Exercise 8.5.3 assumed it for simplicity. 
Exercise 8.5.14. Show that the following formula holds relating the covariant derivative, the 
metric and the bracket. 

2(X, VzY) = Z(X,Y) + ¥ (X, Z) — X (Y, Z) + (¥, [X, Z]) + (Z, [X, Y]) — (x, [Y, Z]). 
Exercise 8.5.15. Here is a first small step toward getting away from the induced metric of R"*!. 
A metric (-, -) is a multiple of another metric (-, -) if 

(X, ¥Y) =A (X,Y) 


for all X and Y, where A is a constant. If A is allowed to be a smooth function on M, then the 
metrics are conformal. Denoting the covariant derivative and curvatures of the associated metric 
(.,-) by V, R, Ric, &, show that the following properties hold for a “multiple” metric. 


(1) Vx¥ = Vxl. (Use the formula from Exercise 8.5.14.) 
(2) R(X, Y, Z, W) =A R(X, Y, Z, W). (Use (1).) 


(3) R(X, Y)= . K(X, ¥). (Use (2).) 
(4) Ric(X, Y) = Ric(X, Y). (Use (2).) 


(S)k= oe. (Use (4).) 


Exercise 8.5.16. Let P* = {(u',..., u*) € R* | u* > 0} be the “upper half space” analogous to 
the two-dimensional Poincaré plane. Define an analogous metric on P* by 
vow 
(v, w) = ae 


In our present language, this means that 


= RR ei aes 5 
8 = (8) = Gane = Gaipe 
where the e; are the standard orthonormal Ré basis vectors. Hence, the metric is diagonal. The 
sectional curvature of P* is known to be constant at —1. The following steps lead to a partial 
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understanding of this result. For further and complete details, including formulas for general 
conformal metrics, see [dC92, Chapter 8.3]. 


(1) Show that the Christoffel symbols are 


l 
r= 5 3 8 (8:8 j1 + 8; 811 ~ 9/8:;) 
1 


1 rr 
= 58 (9:8 jr + 9j8ri — 9,8i;) 
and that the only non-zero ones are 


j 1 ‘ l ; 1 
Mya for j #k, M4 for i # k, rh =—z- 


(2) Now show that the components of Riemann curvature relevant to sectional curvature are 


ar, 
Rijij = De (tt -F oe ar + UTE sn _ ran) 8lj 
I 
ari, ary mpl mpl 1 
- (Fe ~ “Bui mS Pr; m VET im (ae) 


and the only non-zero components are Ryjxj = Risk = Rijiy = —1/ (u*)*. Here we assume 
kK # j in Rejgj, i H#k in Rigix and i # j,i #kand j Fk in Rijij. 


(3) Using the calculation of (2), show that the components of sectional curvature K;; = 
Riji;/(8ii + 8 jj) for the cases above are constantly equal to — 1. This exercise indicates how 
the conformal metrics of Chapter 5 may be generalized to higher dimensions. 


Exercises 8.5.17. A manifold M¢ is said to have nonnegative sectional curvature if K(X, Y) > 0 
for all X and Y. The Ricci curvature is said to be nonnegative when Ric(X, X) > 0 for all X. 
Show that nonnegative sectional curvature on M implies nonnegative Ricci curvature on M and, 
in turn, this implies nonnegative scalar curvature on M. Hint: First prove the formula 


k-1 k-1 
Ric(X, X) = (X,X) }> K(X, &)) = (X, X) )> K(Ex. €)) 
j=l j=l 


for a frame {Ej} spanning X+, the space of vector fields orthogonal to X, and & = X/|X|. 


Exercise 8.5.18. Here is an exercise which at once goes beyond our definitions and tests them. 
A Lie Group G is a smooth manifold with a smooth associative multiplication G x G > G 
having an identity e and inverses. Take G to be compact and connected. For example, the special 
orthogonal group SO(n) is a compact connected Lie group. We may define certain vector fields 
on G by fixing a vector X(e) € 7,G and then taking X(g) = g,(X(e)) € T,G. Here, g, is the 
map induced on tangent vectors by the multiplication (g, h) > g-h forall h € G. These types of 
vector fields are called left invariant vector fields. Lie groups are quite symmetric in the sense that 
the covariant derivative (which may be defined without reference to an ambient Euclidean space, 
but which has all the properties of Proposition 8.3.2) and curvature at the identity determine 
the covariant derivatives and curvatures at all other points of G. Furthermore, it is known that 
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VxX = 0 for X any left invariant vector field. Thus, curves whose tangent vectors belong to left 
invariant vector fields are geodesics. Also, for G as above, the following identity is known to hold: 
([X, Y], Z) = —(Y, [X, Z]) for left invariant vector fields X, Y and Z. Here, (-, -) is a metric 
which is first defined on 7,G and then transported around G by the Lie group multiplication. 

Now, for a compact, connected Lie group G with left invariant vector fields X, Y and Z, show 
that 


] 
(1) VxY¥ = 3 [X, Y]. (Consider Vxy_y(X — Y) and use Proposition 8.3.2 (v).) 


] 
2) R(X.Y)Z = 5 (LX, ¥},Z]. 
(Start with the definition of R, then use (1) to express R completely in terms of brackets. 
Finally, use the Jacobi identity for the bracket (Exercise 8.3.4 (3)).) 


(3) If X and Y are orthonormal, then K(X, Y) = ; ILX, Y]|’. 
Thus, sectional curvature, and hence, Ricci and scalar curvatures, are nonnegative for a 
compact, connected Lie group. (Use the definition of K, (1), Proposition 8.3.2 (iv) and the 
identity ([X, YJ, Z) = —(Y, [X, Z]) with Z = Y.) 


The Killing form of a Lie group G is defined to be b(X, Y) = tr(ady ady), where adz(W) = 
[Z, W] is a linear transformation of the vector space of left invariant vector fields. It can be 
shown that b is symmetric, bilinear and invariant under all automorphisms of G. A compact Lie 
group with nondegenerate Killing form is said to be semisimple. The form b is nondegenerate if 
b(V, W) = 0 forall W implies V = 0. In this case b is a negative definite form (i.e., b(V, V) < 0 
for all V # 0) and —d is actually the (bi-invariant) metric (-, -) on G. That is, (X, Y) = —b(X, Y). 
For instance, SO(n) is semisimple. Using (2) above and the definition of adz, show that a 
semisimple Lie group with metric b has Ricci curvature 


Ric(X, Y) = -3 b(X, Y). 


To end this section, we wish to make several calculations using the definitions above to try 
to understand a tiny bit of modern physics. Einstein’s general theory of relativity is based on a 
principle that energy and momentum distort the geometry of spacetime and, as a consequence, 
are responsible for various physical phenomena such as light “bending” around the Sun (along a 
spacetime geodesic) and the precession of the perihelion of Mercury. Although this principle is 
simple to state in our naive discussion, the precise form of the relationship is far from clear. If 
we are to believe in the relationship at all, then perhaps the simplest such formula is 


G=cT 


where G is some type of curvature (called the Einstein curvature) and T is a quantity (called the 
stress-energy tensor) which depends on the amount of energy-momentum in a particular region 
of spacetime. In particular, in a vacuum we have T = 0 — but interesting physics (such as the 
bending of light) can still occur. If G were Riemann curvature R, then the formula G = c T 
together with the assumption T = 0 would imply the vanishing of Riemann curvature. But, as we 
have said earlier, R = 0 implies that a manifold is flat. Hence, spacetime would be forced to be flat 
and this would contradict the interesting geometrically induced physics observed. So, G cannot 
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then be the Riemann curvature. The next logical choice to try is the Ricci curvature, so we take 
G = Ric provisionally. But we must ask, what else do we desire in a physical quantity? Certainly, 
the law of conservation of energy-momentum must be obeyed, so we obtain the vanishing of 
some sort of divergence of T. This means that we should also require G to be divergence free as 
well. Let us consider this now. 

If V =(V!, V2, V°) is a vector field in R? with coordinates x', x? and x3, then we know 


dV! av? av? ns 
qe aa snes V,e;) 


where the e; are the usual Euclidean orthonormal basis vectors. 


Exercise 8.5.19. Verify the last equality 


av’ av? av3 R 
oat + ae ta = Le Vo 


In the same way, on a manifold M*, we can define the divergence of vector fields and bilinear 
quantities with two slots (i.e., 2-tensors) such as the metric (-, -) and Ric. Let A(-, -) be such a 
quantity and define the divergence of A, div A(-), to be a linear quantity with one slot (i.e., a 
1-form; see Section 8.6), 


div A(X) © S~ Ve, A(E:, X) 
= > EA(E, X) — A(Ve,€;, X) — ACE, Ve,X) 


where {€;}{_, is a frame and we use a definition for VA similar to that for VR in the second 
Bianchi identity Proposition 8.5.8. There are several things to note here. First, the divergence 
produces a “tensor” which is completely determined at each point by the values of its constituents 
at that point. Also, although we have used a frame to define the divergence, this is in fact only a 
convenient way to compute — the divergence may be defined without reference to a particular 
frame used for computation. With this in mind, consider 


Example 8.5.20 (The Metric (-, -) has Zero Divergence). 

To see this, take normal coordinates about an arbitrary point p € M and extend to a frame 
{E,,..., &} by parallel translation along geodesics emanating from p. In particular, we have 
V¢,E; = 0 at p. Now, because the divergence is determined by the values of the frame etc. at p, 
the same calculation as below may be made for a normal coordinate frame about every point in 
M. At p we have 


div (-, )(E;) = ys Ve, (Ej, Ei) 
i 
= > EEE — (Ve,E;, E:) — (Ej, Ve, Ei) 
i 


= OEE; €) -0-0 
i 
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since we have normal coordinates at p. But the first term vanishes as well because (€;, €;) = 8 
on the coordinate neighborhood and the €;-derivative of a constant is zero. Therefore, 


div (-, -)(E;) = 0. 
Since this is true for all €; and the process is linear, then it holds for all vector fields as well. 
Exercise* 8.5.21. For a smooth function f: M — R, show that 

div(f(-,-)) =df 


where df is the differential of f defined by df(X) = X[f] for a vector field X. Thus, the 
divergence of conformal metrics (i.e., f > 0 at every p) depends on the conformality factor 
alone. Note that this makes sense in light of Example 8.5.10 since the divergence depends on the 
frame chosen. Verify that, for metric f(-, -), the frame {(1/./f) €;} gives div(f (-, :)) = 0. 


We will now apply the divergence to Ricci curvature to find a remarkable relationship. 


Theorem 8.5.22. For Ricci curvature Ric and scalar curvature «, 
dk = 2 div(Ric). 


Proof. Again choose a frame derived from parallel translation along geodesics of normal coor- 
dinates about an arbitrary point p ¢ M. Hence, at p, Ve,E; = 0 for all 7 and j. Then 


2 div(Ric)(E,) = 2 } > Ve, Rio(E;, Es) 


J 


= 25° Ej Ric(E;, &) 
j 


=2 So ERE; ENEs, €;) 
ij 
= D0 Ej (RE, E)Ess Ei) + D> E(RE:, EjEs, Ej) 


ij ij 


where the second term is obtained by switching i and j. Then, the symmetry relations for R 
imply 


2 div(Ric)(E,) = )° Ej (R(Ej, Es, Ei) + DE (RUE; EE), Es) 
fy ij 
= Do VR(Ej, Ei, Ess Eis Ej) + VRE}, Eis Es Eos Ei) 
ij 
= — SO VR(E;, EE, €), Es) 


ij 
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by the second Bianchi identity. Then, interchanging the first two coordinates, 


2 div(Ric)(E,) = )> VR(Ej, Ej, Ei, Ej, Es) 
ij 
= SOE RE, Ej, Ei, Ej) 
ij 
= Es ye R(E;, ox &;, E;) 
ij 
= E,[«] 


= dk(&). 


Since this is true on a basis, it is true in general. O 


Exercise 8.5.23. The manifold (M*, (-,-)) with metric (-,-) is an Einstein manifold if 
Ric(X, Y) = A (X, Y) fora fixed constant A and all vector fields X and Y. 


(1) Show that, if Ric(X, Y)(p) = A(p) (X, Y)(p) for a smooth function 
4: M* + R with k > 3, then Mé is Einstein. Hints: (1) first compute « and (2) then use 
Theorem 8.5.22. 


(2) Show that a semisimple Lie group with Killing form metric —b is an Einstein manifold. 
Hint: use the last part of Exercise 8.5.18. 


Now, what has been the point of all this? The argument we presented earlier suggested that 
the Einstein and Ricci curvatures should be identified and that the Einstein curvature should be 


divergence free. But, as Theorem 8.5.22 shows, in general, the Ricci curvature has a nonzero 
divergence. So what is to be done to rescue the idea? By Exercise 8.5.21, we know 


div(k(-, -)) = dk 
so that div(« (-, -)) = 2 div(Ric) and, therefore, 
div{ Ric (-,-) 0 
—-K(-+,- = Q. 
2 


This calculation then says that, if we define the Einstein curvature to be G = Ric -} «(-,-), then 
we obtain a type of curvature which is divergence free. Also, we have 


Theorem 8.5.24. The Einstein and Ricci curvatures determine each other. In particular, G = 0 
ifand only if Ric = 0. 


Proof. Denoting contraction by C, we may write 
G = Ric (+) 
= Ric-~k (-,- 

2 


= Ric -; C(Ric) (-, +) 
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since the contraction of the Ricci curvature is scalar curvature by the discussion preceding 
Example 8.5.10. Of course, Example 8.5.10 itself shows that C((-, -)) = dim M, so we also 
have 


C(G) = C(Ric) — > C((-, -)) 
= KR z dim M 


_(,_dimM) | 
= — )k 


Now, spacetime has four dimensions — time and R? — so dim M = 4. Hence, C(G) = —« and 


1 
Me SCs EAs) 


1 
=G-— 3 C(G) {-,-). 


Thus, Ric and G determine each other. im 


Therefore, it seems that the Einstein curvature is a good choice for the Einstein field equation 
G = 8nT (where the constant 87 is determined by taking the Newtonian limit). Indeed, in a 
vacuum, where T = 0, it is generally more convenient to solve the equations Ric = 0 rather than 
G = 0,so the relationship with Ricci curvature is important. Perhaps we should mention what the 
word “solve” means here. What is known and what is unknown? Generally, we try to determine 
the geometric structure of spacetime from a hypothesis on the Ricci curvature and a knowledge 
of some facet of the relevant geometry. In Example 8.5.10 we showed that the Ricci curvature is 
given in terms of Christoffel symbols and their derivatives. These, in turn, are given in terms of 
derivatives of the metric. Hence, the expressions 


Ri, =90 


form a system of second-order partial differential equations in the metric. Therefore, when we 
solve this system, we are determining the metric in that region of spacetime. Since the metric 
determines everything, it can be safely said then that we understand the geometry of spacetime 
in that particular region. 


Example 8.5.25 (The Schwarzschild Solution). The solution (i.e., the metric coefficients) for the 
vacuum field equations R;, = 0 outside a spherically symmetric body of mass M is given by 


2M 2M\"' 
800 = 1—- ae? s=— (: Pe =) , 82 =r, 833 = —r’ sin’, 


where time ¢ is usually given the index 0. This is the Schwarzschild solution discovered by 
Schwarzschild in 1916. Note that this metric is different from our usual ones in the sense that 
it is not positive definite. Spacetime is, in fact, based on the flat (Minkowski) metric of special 
relativity which has the form goo = 1, g1; = —1, g22 = —1 and g33 = —1. These metrics are said 
to be semi-Riemannian metrics and much of the theory of ordinary Riemannian geometry carries 
over to these metrics as well. 
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There are many good texts which discuss the material of this section in great detail. In 
particular, for general discussions of higher-dimensional geometry, see [Spi79, GHL90, dC92]. 
For a particularly elementary discussion of the relativistic bending of light and the precession of 
Mercury’s perihelion, see [Fab83]. For general relativity itself, see the classic [MTW73] and for 
a beautiful combination of differential geometry and relativity, see [O’N83]. 


8.6 The Charming Doubleness 


But the beauty here lay in the duality, in the charming doubleness... 


— Thomas Mann (Felix Krull) 


Everything that we have done above has a dual formulation which is often powerful and quite 
beautiful. First, recall that any (real, say) vector space V has a dual vector space V* = {f: V > 
R | f is linear} consisting of linear functionals on V. In terms of a basis for V, {v;,..., Vx}, the 
dual vector space has basis {¥', ... , ¥} with the defining property 


v(v;) = 6). 


In the same way, if the v; are tangent vector fields which form a basis at each point p € M, then 
their duals are called 1-forms and are denoted by 6’. Of course, the 1-forms 6' dual to the @; have 
the defining property that 6'(9 ;) = oy as well. If we take the vector field basis to be {9;}, then a 
dual 1-form 6’ is usually denoted by du' to indicate that coordinate vector fields 0; = 0/du' are 
in use. 

Now, a linear transformation f: V ® V > V with f(v; ® vs) = >, I'v,v, may be described 
by writing f: V > V* @ V with f(v;)(v,) = f(v; ® V,). Perhaps we should remind the reader 
here that the symbol ® stands for tensor product. The tensor product of two vector spaces V and 


W with bases {v),..., V.} and {w),..., W,} is formed by taking the vector space V ® W with 
basis 
{v; @ w;} 
where i = 1,..., mand j =1,...,n. Of course we may also write 
fi) = yo aj,v @ V, 
iv 


where the ai, € R and, since there is a dual basis element in the formula, we may evaluate at v, 
to get 


Fis) = Y_ aj;¥ (v5) v, 
je 
= » ay;Vr- 
But, f(v;)(v;) = f(v; @ Vs) = ba I‘,v, and, comparing the two formulas, we see that 
ay; = r, 


and this relation holds for all i and s. Hence, f(v;) = )~ is IY,7 @v,. 
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We can now do the same thing for the covariant derivative. More precisely, if V denotes 
tangent vector fields on M, then the covariant derivative is an R-linear map V: V @ V — V. Just 
as above, this map may be rewritten in a dual form as V: V > V* @ V with 


Ves y we, 


where {£),...,€} is a frame on Mé and the w; are 1-forms. Note that we omit writing the 
tensor product symbol ® for convenience. Also, now that we see that this is simply a different 
description of the same covariant derivative, we can dispense with V and simply write V. Let us 
identify the 1-form coefficients in terms of something we know. Let {0), ..., 0} denote the dual 
1-forms to the frame {€),..., &} and consider the following calculation analogous to the vector 
space calculation above. (We use symbols I* to denote coefficients in a basis decomposition even 
though the chosen frame may not be a coordinate frame. Therefore, the [*’s are not the usual 
Christoffel symbols in general, but, in case €; = 0;, then r= r.) 


a PE, = Ve, Ei 
= V(E;)(Es) 
= Oven: 
dv 
by the vector space formula above. By the 1-form formula, however, we have 
V(EMEs) = D> oF (Es) E 


so 


> [oP OE) | & = Do oF (EE. 
r j r 
Since this holds for all s, 
>> h,0/ = af. 
j 
Therefore, the connection |-forms w; may be written in terms of the dual frame and the coefficients 
of the covariant derivative’s basis expansion. 


Exercise 8.6.1. Show that 


he — Ber = (6, E] 
I 


so that there is no reason, in general, that Mi, = i ;- Hint: Proposition 8.3.2 (v). Also, show that 
f*, = —I'i,. Hint: Proposition 8.3.2 (iv). 


Exercise 8.6.2. Show that 


Hint: Proposition 8.3.2 (iv). 


432 8. A Glimpse at Higher Dimensions 


Riemann curvature is defined in terms of “second covariant derivatives”, so if we want to have 
a dual version of this as well, then we must know how to differentiate 1-forms. For a moment, 
consider a function f of k-variables u!, ..., u* and compute its differential to be 


df= y haw 


The du’ are 1-forms dual to the vector fields 9;, so df is itself a 1-form. This calculation from 
calculus tells us that the derivative of a 1-form should be a 2-form. Now, just as a 1-form acts on 
vector fields, a 2-form should act on pairs of vector fields. 


Example 8.6.3 (Wedge Product). 
Let 6 and @ be 1-forms. Define the 2-form 6 A @ by how it acts on a pair of vector fields V 
and W: ; 


9 AG(V, W) = OV) - d(W) — O(W) - OV). 
This “product” @ A ¢ is called the wedge product of \-forms. Note that 1-forms have the property 
that @ A ¢ = —@ A@. In particular, this means that 6 A 6 = 0 for all 1-forms @. 
To conform with the definition of df above, we define the exterior derivative of a 1-form 
f du’ to be 
af 


—— du! a du 
du/ 


d(f du’) =} 


where the u/ are coordinates. This is equivalent to defining d by the formula 
d6(V, W) = V[6(W)] — W[a(V)] — a(LV, W)) 


for any two vector fields V and W. 


Exercise 8.6.4. For coordinate vector fields V = 0; and W =49,, show that the formula 
d6(V, W) = V[6(W)] — W[A(V)] — a([V, W)) gives 


d(f du') = a, f 5! — a, fd! 


where we may assume j <r. Explain why this is the same as the coordinate formula for 
d above. 


Theorem 8.6.5 (First Structure Equation). Let {€;} be a frame on M* with dual frame {6'} and 
connection coefficients {w‘}. Then 


di =-Joaj nd’. 
i 


Proof. We will compute both sides of the equation on vector fields in the frame. 


d6'(E;, E,) = —6' (Ej, E-)) 
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since 6'(E;) = 5 has zero derivative. Then, by Proposition 8.3.2 (v), 
d0'(E;, E,) = —8' (Ve,E, — Ve €;) 
= —6' (VE,(E;) — VE;(E-)) 
-6' (x wE)E — ose08) 
! ! 

= w/ (E,) — w4(€)) 

since 6(€)) = 6;. Now we compute the sum of wedge products. 
Dia AOE; E) = D1 (w(ENOE,) ~ of (E,)9'(Es)) 
! ! 
= w)(E;) — of (E,). 


Because these calculations are the same on a basis, the general formula follows. O 


Now let’s relate these ideas to Riemann curvature. We write, as usual, R(&j, €j)En = 
Dr Rim Er and we compute 


R(Ei, Ej)Em = Vie, e;jEm + Ve, VeEm — Ve, VeyEm 
= VEm (Ei, Ej) + VIVEm(EIME)) — VV Em(EME) 


= E oh, (Eis ENE + V (x ont) (E;)-V > one) (Ei) 
= Lith (Ei, Ej DE + DE (E\)Es + E) Late; )E, 

— Dion (ENE — ven (Ej Let (ENE, 
=-)) [Ewhi) — €j05,(E)) — of (Ei, ED] Es 


+> be 5, (E;)@5 (Ej) — ©, (Ej )@% «) E, 


=> » (dont €j )- dt Aa E, 7) E, 


=-)> ( (1 — oo}, not) £9) 2 


If we let 27, = dw!, — )), w, A wi, then we obtain 


Rt, = 2 (Ej, Ej). 


ijm 


434 8. A Glimpse at Higher Dimensions 


The matrix of 2-forms 2 = (Q/,) is known as the curvature 2-form. Now, we could have defined 
Q by the formula R;,,, = —92),(E;, €;) and then we would have 


Theorem 8.6.6 (The Second Structure Equation). Jf 2 denotes the curvature 2-form and wy, are 
the connection |-forms determined by V, then 


7 = dy, th AQ. 


Whether we define Q2 in terms of R or derive the relationship, we see that we can determine 
curvature by combining the exterior derivative and wedge product of the connection 1-forms. 


Example 8.6.7 (Gauss Curvature by Forms). 
Suppose M: x(u, v) C R? is a surface patch with x, -x, = E, x,-x, =0 and x,-x, =G. 
Define a frame by 


na 6! and 6? be the dual frame having 6'(E;) = B. Hence, in terms of the coordinate 1-forms, 
= JE dx, and 62 = JG dx,. Then 


E E. 
do! = —~ dx, Adx, + —=dx, Adx, 
2VE 2VE 
E, 2 
= dx, A@ 
2VEG 
since dx, A dx, = 0 and dx, = 6?/J/G. Similarly, 
G 
de? = —-—*~= dx, n6'. 
2JEG * 
Now, if we write w} = a dx, + bdx,,thenw? = —adx, — bdx, andd@! = —adx, A 6? and 
dé? = bdx, A 6! (remembering the minus sign in the First Structure Equation). This means that 
E, G, 


= and b=- . 
2VEG 2VEG 


ae). dx, Ads, 
je 6 Ae? 


E, Gu 
2JEG* 2VEG 
(2 sender 
-] E, 
= saa \(Jea), * (yee 
since 0'/./E = dx, and 6*//G = dx,. Therefore, 
do, = Ko' re, 


Hence, w, = dx, and 


where K is Gauss curvature, by Theorem 3.4.1. Note that we then have dw? = —K 6! A 6? as 
well. Of course, we have met the 1-form w} before in Chapter 6, but there we confined it to be 
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along a curve so that it could be thought of as a function. Here its true nature as a 1-form is 
revealed. The formula above, which is surely one of the most beautiful in all of Mathematics, 
also explains results such as Exercise 6.3.10. (Why?) 


Example 8.6.8 (Gauss Curvature of the sphere S* via Forms). 
We have E = R? cos* v, F = Oand G = R’ for the usual patch on S?. A dual frame is then given 
by 


6'=Rcosudx, and 6*=Radx,. 
The exterior derivatives are 


do! = sinvdx, A6? and d6*=0. 


Hence, w} = — sin v dx, and we have 
dw} = —cosvdx, Adx, 
COS U 
=—— 6 Ad? 
R* cos vu 
1 
== 8' Ae. 
R2 


Therefore, comparing this result with the formula dw} = K 6! » 67, we obtain K = 1/R? just 
as we should. 


Exercise 8.6.9. Compute K for other surfaces using the formula dw} = K 6! a 6”. 


In modern geometry and physics, differential forms play an important role, both conceptually 
and in terms of calculation. For a classical approach to forms, see [Fla89]. The reader will also 
find many applications to physics there. A newer treatment is given in [Dar94], where forms are 
applied to understand modern gauge theories. For an approach to surface geometry via forms, 
see [O’N66]. 
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Hints and Solutions to Selected 
Problems 


Chapter 1: The Geometry of Curves 


Exercise 1.1.2 Let p = (—1, 0, 5) and q = (3, —1, —2) and substitute in the equation a(t) = p + t(q — 
P). 


Exercise 1.1.3 a(t) = (—1+1t,6— 5t, 5 — 8t, 98). 


Exercise 1.1.4 The vector (v;, v2, 0) is the hypotenuse of the right triangle whose sides are the vector 
(v,, 0, 0) and a translation of the vector (0, v2, 0). The vector (v,, v2, v3) is the hypotenuse of the right 
triangle whose sides are the vector (v,, v2, 0) and a translation of the vector (0, 0, v3). 


Exercise 1.1.6 135° intersection (or 45° if you prefer). 


Exercise 1.1.25 Consider a(0), a(Z), a(z), and a(34), What is the distance of each of these points from 
the origin? Also note that a(t) - a’(t) = 0 and that a(t) = —a(t). 


Exercise 1.1.26 Start with the basic parametrization a(t) = (rcost,rsint) and recognize that t = 0 
should correspond to the point (r + a, 6) in the xy-plane. Similarly, t = 5 should correspond to the point 
(a,r +5). 


Exercise 1.1.27 a(t) = (acos(t), b sin(t)). 
Exercise 1.2.2. Use the chain rule on £(s) = a(h(s)). 


Exercise 1.2.5 Show that ja’(t)| = r; s(t) = rt; and f(s) = =. Use the definition B(s) = a(t(s)) to obtain 
B(s) = (r cos £,7 sin £). 


Exercise 1.2.6 Show that |a’(t)| = Va? sin? t + b? cos? t. Is s(t) = if |a’(u)|du integrable in closed form? 


Exercise 1.3.2 Using the fact that T - e, = |T||e,;|cos@ and taking derivatives of both sides, first show 
that«N -e, = —sin8 @. There are two normals to 8 at any point on the curve, so we have two cases: (1) 
N = M, where the angle between N, and e, is 6 + 5; and (2) N = No, where the angle between N) and e; 
is 5 —6. UsexN -e; = — sin@ 22 and the definition of the dot product to show that, forN = N,,x = 2% 


@ 
40 ds ds? 
and for N = N2,k =—%. 
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Exercise 1.3.3 Recall that cofactor expansion gives 


i j é v v v v v v 
2 3]; I 3]; 2 
v| v2 WY/= i- jt k. 
W2 W3 Wy W3 Ww, W2 
3 


Exercise 1.3.4 Recall that interchanging two rows of a determinant changes the sign of the determinant. 
What does this imply about w x v and v x w? 


Exercise 1.3.5 Try Maple. 
Exercise 1.3.6 Rewrite Lagrange’s Identity as 


ju x wl? = Jul?{w|? — |v|*|wl* cos? 


and simplify the right-hand side to obtain the desired result. 


Exercise 1.3.8 The area of a parallelogram is given by bh, where 5 is the length of the base and h is the 
altitude. In the parallelogram spanned by v and w, h is equal to |v| sin@, where @ is the angle between v and 
w. 


Exercise 1.3.11 (1) Show that B’(s) = (&. =, 2), 


(2) Note that T’ = p” = (ae — 0). Also, since T’ = KN, |T’| = |k|[N| = x. 
(3) Since T’ =KN,N = r. 
(4) B’ = —tN, so |B’ = [ —t||N|, or |B’] =|—t] =. 


Exercise 1.3.22 Use the fact that p = B(s) + r(s)B’(s) for some function r(s). Differentiate both sides to 
obtain 


(l+r(s))T +r(s)B"(s) = 0. 


Take the dot products of both sides with T to establish that r(s) # 0. Take the dot products of both sides 
with 6” to get the contradictory statement r(s) = 0 — unless 6” = 0. This then says that £ is a line. 


Exercise 1.3.23 Leta(s)— p=aT +bN-+cB sothat T- (a — p)=a,N -(a— p)=b, and B- (a — 
p) = c. Recognize that (a — p)-(a ~ p) = R? since a lies on a sphere of center p and radius R. Take 
derivatives of both sides of this equation to obtain an expression for T - (a — p). Then take derivatives of 
both sides of T - (a — p) = a to obtain an expression for N - (a — p). Finally, take derivatives of both sides 
of N -(a@ — p) = b to obtain an expression for B - (a — p). 


Exercise 1.3.24 Use the previous problem. Let the constant be R? and show thata + +N + 4(4)'Bisa 
constant. 


Exercise 1.4.4 If the road is not banked, a”(t) can be resolved into two components: (1) tangential 
acceleration = 42 5, T(t) = 0 since the car is traveling at a constant speed; and (2) centripetal acceleration = 
Ny (t). By Newton’ s Law, the magnitude of the force due to centripetal acceleration is |mxv?N(t)| = 
mkv’, which must be balanced by the force due to friction, mg. If the road is banked, there are three 
primary forces acting on the car: (1) a downward force mg due to gravity; (2) a corresponding normal force 
exerted by the road; and (3) a kinetic frictional force preventing the car from flying off the road. The static 
frictional force preventing the car from sliding downward is negligible. 

Recall from physics that | f| = u|N|, yielding f, = wN, and f, = wN,. Summing the vertical forces 
on the car yields 


N, =mg+t+ fy =mge+un,. 


The total of the horizontal forces, f, + N,, produces the centripetal acceleration, so we have 


mi? = ti a N, om uN, + Ny. 
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Solve simultaneously for NV, and N, to obtain 


m 2 
Ny = fem — 48] 


m 
Ny = Tale + mew", 


An expression for tan @ may now be obtained. Solve this expression for v and obtain 


< | sitan9 + 4) 
~\ «(1 — tang)’ 
Jo’ xa" 


Exercise 1.4.6 Use x = “EP for both parts of this problem. In the case of the general plane curve 
a(t) = (x(t), y(t)), we have a’(t) = (x’(t), y(t) and a”(t) = (x(t), y(t), leading to: 


_ Oy"O=*"Oy'O 
(y+ O(n]? 


Exercise 1.4.9 As part of Exercise 1.4.8, we have |a’(t)| = 2 cosht. 


Exercise 1.5.2 Since y(s) is not necessarily a unit speed parametrization, use x, = pas a 
(1) Show that y’ x y” = «KB —xcos@(u x N). 
(2) Show that u x N = cos@B — sin@T. 
(3) Show that |y’ x y”| = « sing. 
(4) Show that jy’| = sin @. 


Exercise 1.5.3 (=) 8 acircular helix > y a circle > «, is constant. A circular helix is a special case of a 
cylindrical helix. Thus, T - u = cos @ is constant. What do these results imply about « = «, sin? 9? Finally, 
use the fact that for a cylindrical helix, £ is a constant. 

(<=) t and « constant = + = cot@ is constant. Use this to show that «, is constant. Also show that 
t, = 0 and conclude that y is a circle. 


Exercise 1.5.4 Use the fact thatx = 1 = = 5; (Bxercise 1.3.11) to show that cot @ = = is a constant. 
Exercise 1.5.5 Use eel to compute x = ia Use eee to compute rT = -i. What is true if both r 


and x are constants? 

Exercise 1.5.6 Prove that + constant < 4b* = 9a? by using MAPLE. 

Exercise 1.5.7 WLOG assume that 6 has unit Spee Show that © fis} -u = Oor, equivalently, 1(T'(s) ‘Uy= 
0. Use the fact that (T(s)-u)’ = T'(s)-u+T(s)-u' =T'(s)-u since u is a constant vector. 


Chapter 2: Surfaces 


Exercise 2.1.1 (=) If x, and x, are linearly dependent, then x, = cx, where c is a scalar. Use the fact 
that x, x x, may be expressed as a determinant and use properties of determinants. 
(<=) Xy X X, =O => |x,||x,| sin? = 0. What does this imply about 6, the angle between x, and x,? 


Exercise 2.1.6 Use a Monge patch, x(u, v) = (u, v, u? + v”); determine the u-parameter curve, x(u, vo), 
and the v-parameter curve, x(uo, v). Note that each of the parameter curves lies in a coordinate plane of R?. 


Exercise 2.1.8 x(u, v) = (u, v, +V1 — u? — v2). 


Exercise 2.1.19 A ruling patch for a cone is of the form x(u, v) = p + vd(u) where p is a fixed point. 
Let p = (0, 0, 0) as the cone emanates from the origin. The line that is to sweep out the surface must thus 
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extend from (0, 0, 0) to a point on the circle (a cos u, a sinu, a) lying parallel to and above the xy-plane. 
(The z-coordinate of any point on the circle is a because z = \/x? + y? = »/a* cos¢ u + a’ sin? u = a.) 

A ruling patch for a cylinder is of the form x(u, v) = B(u) + vg where q is a fixed direction vector. The 
directrix B(u) for a standard cylinder is the unit circle in the xy-plane (cos u, sinu, 0). We want a standard, 
right circlular cylinder, so g = (0, 0, 1). 


Exercise 2.1.20 To show that a surface is doubly ruled, we need to identify two ruling patches for the 
surface. Since z = xy = f(x, y), we can use a Monge patch x(u, v) = (u, v, uv) and write x(u, v) in terms 
of B(u) = (u, 0, 0) and 5(u) = (0, 1, u). Alternatively, y(u, v) = (v, u, vu) is also a patch for the surface. 


Exercise 2.1.21. A patch for the helicoid is x(u, v) = (0, 0, bu) + v(acosu, asinu). 


Exercise 2.1.22 A directrix for the hyperboloid of one sheet is the ellipse B(u) = (a cos u, bsinu, 0). Let 
5(u) = B'(u) + (0, 0, c). It can be shown that x(u, v) = B(u) + vd(u) is indeed a patch for the hyperboloid. 
Alternatively, let B(u) be as above and let 5(u) = B’(u) + (0, 0, —c) and verify that this, also, is a patch for 
the hyperboloid. 


Exercise 2.2.5 By definition, 


d 
[fg] = oes lr-o= Vfg(p)-v. 


a aL + 38 £ f. Finally, collect like terms to 


rae eae Cer 


obtain the desired result, fel = = = uf lg + fofg]. 


Exercise 2; 2.6 View x asa function f(p), po, p3) and write v = (v1, v2, v3). Thus, by definition, v[x] = 


or : a, Sos) M1, U2, V3). But since x(p1, p2, P3) = Pris - = = =Oand & = = 1. A similar procedure 
may be used for v[y] and v[z]. 
Exercise 2.2.11 Let a(¢) = x(ai(0) a3(0), B(O) = x(bi(0), ba) with a0) = p = B(O) and a/(0) = v, 


B'(0) = w. Then if y(t) = x((a,(2t) + by (2t))/2, (aa(28) + b2(28))/2), ¥'(D) = tt du 4 8 = x (a/(2t) + 
bj (21) + x,(a,(2t) + b3(2r)). Finding a’(t) and f(t) and substituting yields y’(0)=v+w. Thus, 
(v+w)[f] = yO] = VE - y’(0). By using o[ f] + w[f] = Vf -v + VS - w, show that(v + w)[f] = 
off] + w[ fl. 


Exercise 2.3.4 To compute the eigenvalues of a matrix S, we set det(A/ — S) = 0. This yields, in the case 
of the 2 x 2 symmetric matrix 


the equation 4? — (a + c)A + ac — b? = 0. Solve for A, and A, and show that they are both real. 


Exercise 2.4.4 For the first part, just use S(a’) = —V,-U. For the second (which is also an if and only if), 
show that both S(@’) and a’ are in PM Tay)M for each ¢. 


Chapter 3: Curvatures 


Exercise 3.1.2 Since K = k,k2, k, and ky must be of opposite sign. Because k)(u) is defined to be the 
maximum curvature, k;(u) > k2(u), sok, > 0 and k, < 0 is the case here. 


Exercise 3.1.6 (1) Euler’s formula states that k(u) = cos? 6k, + sin? @k, where u = cos@u, + sin@uz 
(i.e., u is a function of 6). Thus, 


2n 1 2n 
=f k(0)d@ = — (cos” Ok, + sin? 0k2)d0. 
0 


Evaluate the integral, remembering that k, and kz are constants. 
(2) Express v, and v; in terms of u; and u2. That is, v) = cos pu, + singu and v2 = cos(@ + F)u; + 
sin(g + 5 )u2. Use Euler’s formula to obtain k(v,) and k(v2). 
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Exercise 3.1.10 M minimal > H(p) =0 for every p € M. What does this imply about &, and k, and, in 
turn, about K? 


Exercise 3.2.6 Computing (E'G' — F')? directly, setting it equal to |x! x xj? =(1-2Hr+ 
Kr?y(EG — F*) and equating coefficients of powers of ¢ produces the identities: 
2 


(1) (-2K +4H2)EG =G (¢ + a) 4E (¢ 2), 


(2) 4HK)EG =2n(S+2)+e(S + 2) —8m2H; 

@) REG =(E+8)(S+ ) — 4m; 
Now use these in the formulas for H’ and K‘ to establish the mean and Gauss curvatures for parallel 
surfaces. 


Exercise3.2.18 (a) Compute x,,x,,andx, x x, toobtainU = BxS+08*3 Show that (U ‘Ly = rae 
To do so, you will need to recall that a -(b x c) = —b-(a x c). Use the Lagrange Identity to show that 
Ix, x xy? = EG — F? = W?. Finally, show that K = =%-*0" and combine with the above. 

(b) A ruling patch for the saddle surface is x(u, v) = (u, 0,0)+ v(0, 1, uw). Then B(u) = (u, 0, 0) and 
5(u) = (0, 1, u), Use the results of (a) to obtain K = eae 

(c) Note that B(u) = (p), po, p3) so that B’(u) = (0, 0, 0). 

(d) Note that 5(u) = (q;, 42, g3) so that 5’(u) = (0, 0, 0). 


Exercise 3.2.20 For one direction, note that U cannot depend on v only when the term v(d’ x 6) = 0. 
Thus, 5’ x 6 = 0 and the formula for K of a ruled surface shows K = 0. For the other direction, note that 
U, = —S(x,) is a tangent vector. Show that U, - x, = 0 (automatically!) and U, - x, = 0 by the hypothesis 
K = 0 (and the formula for K ofa ruled surface). 


Exercise 3.2.23 If 8 is a line of curvature, then £’- U x U' = B’-U x cB’ = 0 (why?). For the other 
way, show that developable implies that U’ is perpendicular to 6’ x U which is also perpendicular to £’. 
Then note that all these vectors are in the tangent plane. 


Exercise 3.2.26 (a) and (b) are self-explanatory. In (c), use the results of part (b) in the expressions 
K= nn and H = ee and simplify. For part (d), recognize (using results of (c)), that D is the 
numerator of K evaluated at the critical point (uo, vo). Since the denominator of K is always positive, 
D=0>kK=0,D0D<025 K <0,andD>0= K > 0. What must be true of k; and k) when K = 0? 
When K < 0? What do these results imply about the surface? Two sub-cases correspond to K > 0. If 
Fuu(Uo, Vo) is positive, k, and k, must both be positive. If f,,(uo, Vo) is negative, k, and k, must both be 
negative. What must be true of the surface in each of these sub-cases? 


Exercise 3.3.4. F =m =0 for a surface of revolution. Thus, x, and x, are orthogonal and we can 
express S(x,) in terms of the basis vectors x, and x,. Thus, let S(x,) = ax, + bx,. Compute S(x,) - x, and 
recognize that this is equal to /. Compute S(x,)-x, and recognize that this is equal to m. Similarly, let 
S(Xy) = cx, + dx, and take dot products with x, and x,. 


Exercise 3.3.7 (a) Derive 


l-w? 


i d+ we 2 
by using the expression for K for a surface of revolution. Algebraically determine when K > 0, K = 0, 
and K < 0. Part (b) is similar; parametrize the ellipse as a(u) = (R + acosu, bsinu, 0) to produce the 


following patch for the elliptical torus: x(u, v) = ((R + acosu)cosv, bsinu, (R +acosu)sin v). Using 
the expression for K for a surface of revolution yields 


[ ab? cosu 
~ (R +acosu)(b? cos? u + a? sin? uw)?” 
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Exercise 3.3.11 By separating variables, we obtain the expression 


[i 
u=-f je ah 


Make the substitution h = 1/coshw to obtain u = f tanh? wdw. Integrate, then use the fact that 
cosh™'(1h) = In(t + ,/4 — 1) and recall that tanh x = e* — e*e* + e~*. Simplify to obtain u = In|i + 


Viv) _ Bic. 


h 


Exercise 3.4.4 To verify the expression for U,, recall that a u-parameter velocity vector applied to a 
function of u and v takes the u-partial derivative of that function. Thus, 


V,,U = (xu [41], Xy[u2], Xy[u3]) = U,. 


Then, since x, and x, form a basis for 7,(M) and since V,, is in T,(M), we have V,,U = Ax, + Bxy. 
Take the dot product of both sides to obtain V,, U - x, = Ax, -x, = AE. Also recognize that 0 = x,[0] = 
xy [U - xy] = Vi,U +x, + U + xy, and use this to show that A = —//E. Proceed in a similar manner to find 
B and to obtain an expression for U,. 


Exercise 3.4.5 By finding the two partial derivatives (#2) and ($+), we obtain, as an equivalent 
expression for the right-hand side, 
E,G, 2GE,  E,Gy E,E, 2GGyy,  GuGy 
4E°G 4G°E 4G°E 4E°G 4G°E 4G?E- 
Combine these terms over the common denominator 4E?G?. Next, compute 
a ( E, ) — Bigs BBG 
av\ VEG) VEG 2%EG)i 


— EE,G,X(EG)? 


and 


VEG) JEG 2EG)i 2(EG)?- 


Substitute into the given expression and write the result over the common denominator 4E?G? to obtain the 
same expression as above. 


a ( G, — Guu G,E,G  G,EG, 
ou 


Exercise 3.4.6 A patch for a sphere of radius R is given by the formula x(u, v) = (Rcosu cosy, 
Rsinucosv, Rsinv). Compute x,, x,, E, and G and substitute into the given expression to obtain 
K =1/R?. 


Exercise 3.5.1 Since S, is a linear transformation from 7,(M) to itself, we can write S(x,) = Ax, + Bxy. 
But since p is an umbilic point, we have S(x,) = kx, => B = 0. Now use 


l= S(x,)-%, =AE+4+ BF, m= S(x,)-x» =AF+BG 


and solve for B to get B = 5448" = 0, so —Fl + Em = 0 orl/E = m/F. Do the same for S(x,). 


Exercise 3.5.9 A patch for a surface of revolution is given by x(u, v) = (u, h(u) cos v, h(u) sin v), Then 
- 


K= eT 0 => —h" = 0. Thus, h(u) = Cyu + C? (aline). Note that, ifC, = 0,a cylinder is generated 


by revolving A(u) about the x-axis; if C; 4 0, a cone is generated. 


Chapter 4: Constant Mean Curvature Surfaces 


Exercise 4.2.3 Use a Monge patch x(u, v) = (u, v, f(u, v))to obtain f, = g'(u), fuu = 84), fy = A'(v), 
fav = 9, and foy = h’(v). Then 


H=08 (1 +h) 9"(x) + (1 + 87(x))A"(y) = 0. 
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Separate variables to obtain 
—s"(x) _ hy) 
1l+g9%(x) 14h7(y) 
Since x and y are independent, each side is constant relative to the other side. Thus, let 


_ 78") 

1 +9(x) 
Also let w = g’(x) so that 2”(x) = 4 and integrate to obtain w = dg = = —tanax. Integrating again gives 
g(x)=1 guncos ax). Apply the same Freaconiiie to the other side of the ori iginal differential equation to obtain 
h(y)=—-; 4 In(cos ay). Combining terms yields 

1 cos ax 

Fes. y) = Zin ). 
a cosay 


Exercise 4.2.7 “= = (p’ x 5y - 8’ + (p’ x )- 8”. Both terms in this expression are zero. 
Exercise 4.3.3 Compute 
OP _ (fawV + faVul + fe + 0) — Vit fau = V Su fo fav 
au” a++ (2) 


and 


aQ — (fw + foV. + f2 + f2)- VEG fie Vit Sas 
av (1+ f2+ #2)! 


Then apply Green’s Theorem: 
[ [e+ Ba av= f Pdv—Qdu. 


Exercise 4.4.1 Compute partial derivatives to obtain 
—+—=-V,-(Uxx,)+V,-(U x x,)+V-[U, x x, — Uy X xy). 
v 
Since U isa function on M, we have U, = V,,U = —S(x,) and U, = V,,U = —S(x,). Substituting in the 


above equation and be 


a 
an steal “UU xxy)+V,-(U Xxy)+ V -(2Hx, X Xy). 


Now apply Green’s Theorem. 
Exercise 4.6.2 For Cauchy-Riemann, 2? = x? — y* + i2xy, so 


ag, ap a ay 
ax t~*éY ay 4 ax” 


Thus, f(z) = z* is holomorphic and f‘(z) = 2x +i2y = 2z as it should. 
Exercise 4.6.8 

= ow 2a 
az =3 ($+ a ei av 


ee bs set A dE 
-3($ tie +i | 
0 


by Cauchy-Riemann. 
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Exercise 4.8.10 The calculations are exactly as in Example 4.8.9 except that an extra factor of i occurs in 


each term. This affects the real parts to produce x! = sinhu sin v, x? = —sinhucosv and x? = v; thus, a 
helicoid. 
Exercise 4.8.29 M is minimal with isothermal coordinates, so / = —n and, consequently, 
ae In-m — —-? —m P +m? 
~ EG—F2 E2-0 — E? 


Exercise 4.8.30 A conformal Gauss map implies U,,- U, = 0. Plugging in the usual expressions for U, 
and U,, we get 0 = mH. Now consider the two cases, m = 0 and H = 0. 


Chapter 5: Geodesics, Metrics and Isometries 


Exercise §.1.12 A parametrization of a right circular cylinder is given by x(u, v) = (Reosu, Rsinu, bv). 
Then a curve on the surface is given by a(t) = (Rcosu(t), Rsinu(t), bu(t)). Find a” by differentiating 
d 


twice and noting that the chain rule gives + cosu(t) = — sinu%, Now, a” = ay, + (a” -U)U, where 


U in this case is (cosu, sinu, 0). Taking the dot product of a” with U yields a”. U = —R(#). From 
this and from a” = a, + (@”-U)U, we know that aja, = 0 results in a = 0 and oy = 0, yielding 
u(t) = kit +c; and v(t) = kot +2. Thus, a(t) = (Rcos(kit + ¢;), R sin(kyt + c)), b(kot + c2)). Finally, 
consider the following cases: (1) c) = cz = 0, ky) = kp = 1; (2) ky = 1, k2 = 0, ¢; = 0,c2 #0; (3) k; = 1, 
k, =0,¢; = 0, c¢ = i 
Exercise 5.2.12 

VG sing = VG cos(/2 — 6) =x, -a = Gu' 


since a’ = x,,u' +x,v’. Now use the relation v’ = c/G derived from the second geodesic equation. 


Exercise 5.2.13 In polar coordinates, a patch for the plane is given by x(u, v) = (u cos v, u sin v). Compute 
E=1, F =0,and G = uv’, verifying that x is u-Clairut. Then we have 


t cVE is cdu 
v(up) = Ga a 
uy VGVG —c uy UVu? — c? 


Integrate using the substitution u = c sec x > du = csecx tanx dx to obtain v(u) — v(uo) = + cos7! £,or 
ucos(v — vp) = c, the polar equation of a line. 


u(u) 


Exercise 5.2.14. Compute E = 2, F = 0, and G = u’, verifying that the patch for the cone is u-Clairut. 
Then 


H0) = vu) = [eV EVE VG = Adu = aE 


Integrate using the substitution u = c sec x to obtain v(u) — (uy) = V2 sec™! 4 


ui 


Exercise 5.4.6 Compute E = 1 / (1 — 2/4), F =0,and G = / (1 —u2/4)?. Then, K = 


~3766 (ao (zea) * a (ye8)) = ~ayee (a (Ze) 


Find the required derivatives and work through the algebra to obtain K = —1. 


Chapter 6: Holonomy and the Gauss-Bonnet Theorem 


Exercise 6.1.4 A patch for the torus is given by x(u, v) = ((R +rcosu)cosv,(R+rcosu)sinv,rsinu). 
Find x,, Xy, and compute |x, x x,| = r(R +rcosu). Then 


2n 2n 
SA= ‘, i r(R+rcosujdudu = 4n’rR. 
0 0 
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Exercise 6.1.5 (1) A patch for a surface of revolution is given by x(u, v) = (u, A(u) cos v, h(u) sin v). 
Find x,, x), and compute |x, x x,| = A(u)(1 + h?(u))?. Recognize that for a surface of revolution, h(u) is 
usually written as f(x). Use the expression for surface area to complete the exercise. (2) Define a Monge 


patch x(u, v) = (u, v, f(u, v)). Then |x, x x,| = /1 4+ f? + f?. 
Exercise 6.1.8 A patch for the bugle surface (with c = 1) is given by 


x(u, v) = (u — tanhu, sech ucos v, sech u sinv). 


Compute |x, x x,| = (sech* u tanh? u + sech? u — 2sech* u + sech® u)? and simplify using the identity 
tanh? u = 1 — sech’ u, obtaining |x, x x,| = sech u tanhu. Then 


2x 
sa= | i, sech utanhudvudu = 27. 


Exercise 6.1.9 (1) Total Gaussian curvature is given by 


cosu 
[x SR eer |Xy X Xy|dudv, 
r(R +rcosu) 


for the torus, where |x, x x,] =r(R+rcosu). (2) A patch for the catenoid is given by x(u,v) = 
(u, coshu cos v, coshu sin v), yielding |x, x x,| = cosh*u. Also, K = -1/ cosh? u. Integrate fe K to 
show that the total Gaussian curvature is —47. 


Exercise 6.3.1 a’[V-V] = 2V,V-V =0 since V is parallel. 


Exercise 6.3.3. a’ [V-W]=VywV-W+V- Vy W. V and W parallel implya’[V - W] = Oand V parallel, 
a’[V -W] =0, a'[W - W} = 0 imply Vy W is perpendicular to both V and W ina plane. Thus, V,,W = 0 
and W is parallel. 


Exercise 6.3.10 For the R-sphere, K = 1/R? and |x, x x,| = R? cos v. Integrate f,, K to show that the 
total Gaussian curvature above up is 27 — 27 sin vg. We know that the holonomy around the vo-latitude 
curve is —27 sin vg. Thus, the holonomy around a curve is equal to the total Gaussian curvature over the 
portion of the surface bounded by the curve (up to additions of multiples of 277). 


Exercise 6.3.11 At the Equator, vp = 0. What is the holonomy along the Equator and what does this imply 
about the apparent angle of rotation of a vector moving along the Equator? What does this signify about the 
Equator? 


Exercise 6.4.4 But, of course, gravity really doesn’t point that way on a planetary torus, does it? 


Exercise 6.5.5 The vector must come back to itself, so the total number of revolutions it makes is a 
multiple of 27. 


Exercise 6.5.10 Note that the Gaussian curvature for both H and P is a constant K = —1. Thus we have 


[x=- [ =~aa of A. 
& & 


But, since the sum of the interior angles of the triangle differs from = by (+ or —) the total Gaussian 
curvature, we have 
Li; —m = —area of A. 


What does this imply about the sum of the angles, noting that area is a strictly positive quantity? 


Exercise 6.6.7 If K <Qand K < 0 at evena single point, then the total Gauss curvature is negative. But 
the Euler characteristic of the torus is zero. 


Exercise 6.7.3. A disk has Euler characteristic 1. 


Exercise 6.8.17 Their curvatures are not bounded away from zero. 
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Chapter 7: The Calculus of Variations and Geometry 


Exercise 7.1.9 


d f . of OF Fg OF of 34 Ff 
_ —-x— Jae H+ K+ 4% -F— - KS 
dt 0x ot ax Ox 0x dt 0x 
(af d af 
proto 
v +i(Z aa) 


=0 
if and only if 4 — 4% = 9. 
Exercise 7.1.13 x(t) =f —sinf. 
Exercise 7.1.15 x(t) = sin¢. 


Exercise 7.3.3 The Euler-Lagrange equation for the time integral 
1 12 
T= i VEY ae 
ky 


is 


vity? _ a =c 
ky kyJ/1 + y”? 
Then 
1 — 
ky 1+ y? 
is separable with 
Y = 
Je=a dy = dx. 


The solutions are then (x — a)* + y* = c?, circles centered on the x-axis — the geodesics of the Poincaré 
plane! 


Exercise 7.5.3 x(t) =1 —sint+ b. 
Exercise 7.5.5 x(t) = csint. 


Exercise 7.5.16 


2 


1 1 
Essie St —xsint — 5p’ +x? —(%— p)p 


Exercise 7.5.18 


E=x—x?— p*+x?-(%- p)dp 
=x? — p?—2px+2p* 
= (i - py 


> 0. 
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Exercise 7.6.5 The Euler-Lagrange equation is 
d 
Pho tv 
with simplification ¥ = —A and solution 
x 
x(t) = aa +at+b. 


The initial conditions give a = 1/2 and b = 0. Applying the constraint, we obtain 


7 Af! d 
a--3f ?—rdt = — 
12 2 Jo 12 


so that A = 7. Finally, x(t) = —32? + Zr. 


Exercise 7.6.13 The equations of motion for the particle in the paraboloid are 
(14+ 4u?)i+ 4uv4+2u-uv?=0 b=. 


Exercise 7.6.17 T = m/2(Eu? + Gv’), so (forgetting m) 
E = 1/2(Ew + Gi? — Ep} — Gp) — (a — pi)Ep: — (6 — p2)Gpr 
Le 2) 5 
= 5(E ~ pil + GO — pr)’) 


> 0. 


Chapter 8: A Glimpse at Higher Dimensions 
Exercise 8.3.4 


[fV,gW] = fV[gW]— eW[fV] 
= fV[g|W + feVW — gW[f]V —-efWV 
= fel[V,W]+ fV[g]W —eWLf]V. 


Exercise 8.3.11 Assume x, - x, = 0 and take an orthonormal basis x, = x,,/ VE and x, = x, / JG. Then 


(7.3.)" = es (25) ) 2 (FE aePs) _ L 


I+E 
peta = 2H since F = 0. 
EG 


Similarly, (V5, %)” = n/G. Then, the sum is 
Exercise 8.5.17 If we take X/|X| as &, then €),..., € is a frame for M*. By definition then, 
k-l 
Ric(X, X) = (X, X) Ric(Ex, Ex) = (X, X) D> (REx, EE. Ej) 


j=l 


since (R(Ex, ExJEx, E) = 0. Then use the definition of sectional curvature. 
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Exercise 8.5.21 Let X = 5°, X/E; anddf = 5° dfj6/ where 6/ is a dual vector space basis element to 
E; (see §6) defined by 6/(€) = 8 (where 8) is the Kronecker delta). Then 


df(X)=df()) XE) = Vo X/afE) = o> Xa fie'(E) = DXi df. 


J joi j 


Also. we have by the definition of divergence, 


div(f (-, -)(X) = Dove: )Ej, X) 
= Tey (Ej, X)] — f (Ve, Ej, X) — FE}, Ve, X) 
i 
= DEMME) X) + FEME;, X) 
. — f (Ve, Ej, X) — F(E;, Ve, X) 
= DEMME). X) + f(Ve,E), X) + £ (Ej, Ve,X) 
j 


— £(Ve, Ej, X) — F(E;, Ve; X) 
=) EAFME) X) 


= Do af(é)x/ 
i 


=) df;Xx! 
i 
= df(X). 
Exercise 8.5.23 : ; 
K= Y > RiclE;, &) = SOME, &;) = ki. 
i=] i=l 


Thus, dk = kdd and we also have (by Exercise 8.5.21) 


2dd = 2div f(-,-) 


= 2 div(Ric) 
= dk by Theorem 8.5.22 
=kdh. 


Thus, (k — 2)dA = 0 and, since k > 3, we must have dA = 0. Hence, A is a constant. 


Suggested Projects for Differential 
Geometry 


The following are suggested projects for Differential Geometry. All problems come from the 
MAA Edition of Differential Geometry and its Applications, but some material from my book 
The Mathematics of Soap Films ({Opr00]) may also be required. 


Project 1: Developable Surfaces 


Developable surfaces are the special cases of ruled surfaces (those having a parametrization 
x(u, v) = B(u) + v d(u)) having zero Gauss curvature. These surfaces have industrial applica- 
tions. The project consists of: Exercise 3.2.20 — Exercise 3.2.25 and either of two choices: 


(1) Exercise 5.5.5, Exercise 5.5.7, Exercise 6.9.1 and the Maple procedure holounroll in 
Section 6.9 and Maple material in Section 5.6: 


(2) A synopsis of Section 5.7 together with the Maple work found there and that in Section 2.5 
for the tangent developable. For the synopsis, present the motivation and important points 
of the discussion as well as the most important calculations. 


Project 2: The Gauss Map 


The Gauss map is a mapping from a surface to the unit sphere given by the unit normal of 
the surface. The Gauss map can tell you a great deal about the surface. The project consists 
of: Exercise 2.3.9 — Exercise 2.3.12, Exercise 4.8.27, Proposition 4.8.28, Exercise 4.8.30 and a 
description of Schwarz’s theorem in The Mathematics of Soap Films giving a criterion for area 
minimization. Maple should be used to draw Gauss maps for various surfaces. 


Project 3: Minimal Surfaces and Area Minimization 


Minimal surfaces are those with mean curvature equal to zero at each point. Ifa surface minimizes 
area inside some boundary, it is a minimal surface. So soap films are physical manifestations 
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of minimal surfaces. This project focusses on some aspects of area minimization and minimal 
surfaces. It consists of: Exercise 2.1.16, Exercise 4.2.3, Exercise 4.3.5, Exercise 4.3.6 (the Maple 
approach from The Mathematics of Soap Films), Exercise 4.9.2, Exercise 4.9.3, Exercise 7.3.8, 
Exercise 7.3.9 and the Maple material in Subsection 4.9.5. 


Warning The following two projects do not have as many exercises associated with them, but 
they require learning about elliptic functions and explaining what you learn. So, do not think 
they are easy. 


Project 4: Unduloids 


One-celled organisms sometimes take the shape of unduloids. These are surfaces of revolution that 
arise from minimizing surface area subject to enclosing a fixed volume (read Theorem 7.7.15). 
Equivalently, unduloids are examples of surfaces of revolution with constant non-zero mean 
curvature. The project consists of: deriving the differential equation of Theorem 3.6.1, and using 
the equation to parametrize unduloids via elliptic functions. This material is in Section 3.7. Explain 
this material, carry out all Maple calculations and plot unduloids. Furthermore, do Exercise 3.7.4. 
Finally, plot geodesics on unduloids by carrying out the discussion in Subsection 5.6.4 (and using 
“halfbouncepoint”) and do Exercise 5.6.9. 


Project 5: The Shape of a Mylar Balloon 


Mylar balloons are found at children’s birthday parties. They are constructed by sewing together 
two disks of Mylar and inflating. The “sideways” shape is determined by a variational argument 
from the calculus of variations. The project consists of: explaining the calculus of variations 
derivation of the parametrization of the Mylar balloon using elliptic functions. Explain the 
material on elliptic function in Section 3.7 (including Exercise 3.7.4) and the material on the 
Mylar balloon in Section 7.9 (including Exercise 7.9.5). Create a “halfbouncepoint” procedure 
for the Mylar balloon similar to the one in Subsection 5.6.4. Finally, do Exercise 5.6.9. 
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parameter curve, 67 
parametrization, 1, 68 
by arclength, 14, 16 


reparametrization, 15 
by arclength, 15 
Ricci curvature, 420, 422, 424 
divergence of, 427 
for surfaces, 423 
Richmond’s surface 
WE representation, 191 
Riemann curvature, 417 


generalizes Gauss curvature, 418 


symmetries of, 419 
Riemannian connection, 280 
Ros’s theorem, 176 
roulette, 134, 135 
ruled minimal surface, 167 
ruled surface, 74, 117 


of line, 2 ruling, 75 
patch, 68 
Monge, 70 saddle surface, 75, 94, 116 


path connected, 69 
pendulum, 361 

and spring, 362 

double, 363 
periodic solution, 360 
plane evolute, 31 
Plateau’s problem, 170 
Poincaré plane, 228 

geodesic, 232 
Poincaré upper half space, 423 
pole, 299 


scalar curvature, 421, 423 
Scherk’s fifth surface, 170 
Scherk’s surface, 165 

WE representation, 191 

via Maple, 198, 201 
Schwarz inequality, 6, 341 
Schwarzschild solution, 429 
second Bianchi identity, 419 
second derivative test, 118 
second fundamental form, 407 

properties of, 407 


Pontryagin maximum principle, 368, 370 second structure equation, 434 
potential energy, 312, 323, 326, 351, 353, 358, 362, second variation, 343 


412 secondary variational problem, 343 
potential function, 338 sectional curvature, 418, 424 
principal curvatures, 93, 109 as Gauss curvature, 418 
principal normal, 18 shape operator, 84, 107, 108, 279, 406 
principal vectors, 93 and Gauss map, 110 
procedure, 43 as symmetric transformation, 88 
product rule, 6, 81 formulas, 108 
pseudosphere, 121-123 of cylinder, 85 
pursuit curve, 13 of hypersurface, 418 

of saddle, 85 
real part, 182 of sphere, 84, 407 
recreate procedure, 54 of torus, 85 
recreate3d, 55 zero, 85 
rectifying curve, 36 shortest distance, 4, 6, 302, 312, 324, 350, 370 
characterization of, 36 shoulder, 263 

regular, 3 Maple procedure for, 270 


regular mapping, 68 shrinkable curve, 287 


Index 


simplify correction, xvi 
smooth, 68 
soap bubble, 175 
soap film, 163 
space evolute, 36 
spacecurve, 44 
speed of curve, 3 
sphere, 84, 398, 401, 407 
Gauss curvature of 
from forms, 435 
sectional curvature of, 418 
spherical curve, 25 
spiral of Cornu, 38, 348 
spring-pendulum, 362 
state equations, 368 
stereographic plane 
geodesic, 234 
stereographic projection, 195, 398 
stereographic sphere, 229 
structure equation 
first, 432 
second, 434 
Sturm-Liouville theorem, 304 
subs command, 44 
surface, 68 
closed, 296 
compact, 70 
convex, 296 
non-orientable, 80 
of Delaunay, 133, 137 
of revolution, 72 
orientable, 80 
ruled, 74 
saddle, 75 
surface area, 164, 414 
surface of revolution, 119 
surface tension, 161 
suspension bridge, 11 
symmetric matrix, 88 


tangent developable, 102, 117 
tangent indicatrix, 64, 294 
tangent plane, 77-79 

tangent space, 398 

tangent vector field, 402 
tantrix, 64, 294 

taut string, 363 

tautochrone, 9, 61 
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theorem egregium, 124 
torsion 
geodesic, 282 
of curve, 20, 23, 29 
total, 93 
torsion of curve, 43 
torus, 73, 120 
flat, 229 
total Gauss curvature, 110, 277, 283 
total torsion, 93 
of curve, 22 
tractrix, 121 
transition map, 398, 399 
transversality condition, 318, 320 
triangle angle sum, 276 
trick to remember, 17 
tubeplot, 45 
twist, 22 
twisted cylinder, 105 


umbilic, 127, 193 
umbilic point, 95 
undetermined time problem, 318 
undulary, 136 
unduloid, 136, 142 
geodesic, 251 
mean curvature of, 147 
parametrization, 144 
unit normal, 79, 80, 82 
unit speed relation, 222 
unrolling, 236, 237 
upper half-plane, 228 


variation, 313, 414 

vector field, 81, 82, 337, 402 
parallel, 280, 411 

velocity vector, | 

Viviani’s curve, 26, 61 


wave equation, 364 

wedge product of forms, 432 

Weierstrass condition, 340 

Weierstrass excess function, 336, 341, 357 

Weierstrass-Enneper representation, 188, 189, 206 
of the bat, 207 

Weierstrass-Erdmann condition, 344 

Weingarten map, 84 

witch of Agnesi, 11, 45 
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| Second Edition | John Oprea 


Differential Geometry 
and Its Applications 


Differential geometry has a long, wonderful history. It has found relevance in areas 
ranging from machinery design to the classification of four-manifolds to the creation 
of theories of nature's fundamental forces to the study of DNA. This wide range of 
applications means that differential geometry is not just for mathematics majors. 


It is also an excellent course of study for students in engineering and the sciences, 


This book studies the differential geometry of surfaces with the goal of helping 
students make the transition from the compartmentalized courses in a standard 
university curriculum to a type of mathematics that is a unified whole. It mixes 
together geometry, calculus, linear algebra, differential equations, complex variables, 


the calculus of variations, and notions from the sciences. 


That mix of ideas offers students the opportunity to visualize concepts through 

the use of computer algebra systems such as Maple. The book emphasizes that this 
visualization goes hand-in-hand with understanding the mathematics behind the 
computer construction. Students will not only “see” geodesics on surfaces, they will 
also observe the effect that an abstract result such as the Clairaut relation can have 
on geodesics. Furthermore, the book shows how the equations of motion of particles 


constrained to surfaces are actually types of geodesics. 


The book is rich in results and exercises that form a continuous spectrum, from those 


that depend on calculation to proofs that are quite abstract. 
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